First 10 Minutes¶
Fast triage checklists for the first 10 minutes of an Azure Monitor incident.
Use these pages before opening a full playbook when you need to quickly classify whether the problem is data ingestion, alert delivery, cost growth, or query performance.
flowchart TD
A[Incident signal] --> B{Primary symptom}
B --> C[No data]
B --> D[Alert not firing]
B --> E[High cost]
B --> F[Query timeout]
C --> G[Validate ingestion path]
D --> H[Validate signal, rule, action group]
E --> I[Validate top tables and sources]
F --> J[Validate scope and query shape] Checklists¶
| Checklist | When to Use |
|---|---|
| No Data | Logs or metrics are missing, stale, or never arrived in the workspace |
| Alert Not Firing | Threshold was crossed but no alert or notification appeared |
| High Cost | Workspace ingestion cost or daily GB suddenly increased |
| Query Timeout | Logs, workbooks, or scheduled queries are slow or timing out |
Suggested First Pass¶
- Pick the checklist that matches the user-visible symptom.
- Confirm whether the issue is broad or isolated to one resource, table, or rule.
- Collect one KQL signal and one control-plane signal before changing configuration.
- Escalate to the linked playbook when the first-response path narrows to one or two hypotheses.