First 10 Minutes¶
When an incident starts, these checklists help you quickly stabilize and identify the scope of the issue.
Response Strategy¶
A structured response in the first minutes prevents chaotic troubleshooting and speeds up resolution.
| Step | Action | Description |
|---|---|---|
| 1 | Scope | Determine if the issue is global, regional, or specific to a user/resource. |
| 2 | Check Health | Review the Azure Service Health dashboard. |
| 3 | Triage Channel | Use the channel-specific checklists below. |
| 4 | Gather Evidence | Collect recent logs and metrics. |
Quick Checklists¶
- SMS Delivery Checklist - For issues with outbound SMS.
- Email Delivery Checklist - For email bouncing or verification issues.
- Chat Connectivity Checklist - For real-time messaging failures.
- Calling Quality Checklist - For voice and video issues.
Immediate Questions to Answer¶
- Did anything change in the environment recently? (Code deploy, configuration change)
- Is the issue reproducible?
- Are all users affected, or just a subset?
- Is there an ongoing Azure region outage?
See Also¶
Sources¶
- Azure Incident Response Framework
- Support Engineering Triage Guide