Incremental, timed rollout
Description
To reduce operational risk when rolling out new code, a best practice is to slowly, incrementally roll out code changes to a fleet, pausing in between certain breakpoints, and optionally allowing for metrics to pause or abort a rollout.
Proposal
- Pick a good rollout standard such as 1%, 5%, 10%, 20%, 50%, 100%
- Roll out to each group amount, then wait for specific time, then continue
- Ability for manual abort of rollout
- Future: Ability for monitoring metrics to automatically abort rollout