An Incident Response Playbook for the Cloud Era
Cloud incidents move faster, span more accounts, and produce more evidence than on-prem ever did. Your playbook should reflect that.
The first ten minutes of a cloud incident determine the next ten days.
Pre-commit before the incident
- A break-glass account in every cloud tenant, with hardware-backed credentials stored offline.
- A read-only forensics role pre-deployed to every account.
- Immutable log destinations the production account cannot delete from.
The first ten minutes
- Declare. Open a dedicated channel. Assign an incident commander.
- Contain. Revoke suspected credentials before you finish investigating.
- Preserve. Snapshot affected resources before anyone is tempted to "just restart it".
After the dust settles
Run a blameless postmortem within five business days. Publish the timeline internally. The lessons compound; the blame does not.
