Skip to content

First 10 Minutes

Fast triage checklists for the first 10 minutes of an Azure Monitor incident.

Use these pages before opening a full playbook when you need to quickly classify whether the problem is data ingestion, alert delivery, cost growth, or query performance.

flowchart TD
    A[Incident signal] --> B{Primary symptom}
    B --> C[No data]
    B --> D[Alert not firing]
    B --> E[High cost]
    B --> F[Query timeout]
    C --> G[Validate ingestion path]
    D --> H[Validate signal, rule, action group]
    E --> I[Validate top tables and sources]
    F --> J[Validate scope and query shape]

Checklists

Checklist When to Use
No Data Logs or metrics are missing, stale, or never arrived in the workspace
Alert Not Firing Threshold was crossed but no alert or notification appeared
High Cost Workspace ingestion cost or daily GB suddenly increased
Query Timeout Logs, workbooks, or scheduled queries are slow or timing out

Suggested First Pass

  1. Pick the checklist that matches the user-visible symptom.
  2. Confirm whether the issue is broad or isolated to one resource, table, or rule.
  3. Collect one KQL signal and one control-plane signal before changing configuration.
  4. Escalate to the linked playbook when the first-response path narrows to one or two hypotheses.

See Also

Sources