Skip to content

Monitoring and Alerting

Track storage health, performance, and availability metrics.

Metric Category Description
Availability Health Percentage of successful requests.
Latency Performance Time taken to process requests.
Transactions Load Number of storage operations.
Egress/Ingress Data Volume of data moved in/out.
Capacity Usage Total used storage space.

Tip

Set alerts on "Availability < 99%" and "E2E Latency > Threshold" for early incident detection.

mermaid graph TD A[Monitor] --> B[Metrics] B --> C[Alerts] C --> D[Email/SMS] A --> E[Logs] E --> F[Log Analytics]

Monitoring Checklist

  • Define baseline thresholds for latency and availability.
  • Configure metric alerts for 429, 503, and latency spikes.
  • Route alerts to on-call channels and ticketing systems.
  • Enable diagnostics for blob, file, queue, and table services.
  • Retain logs long enough for incident and trend analysis.
  • Review capacity and transaction growth monthly.

See Also

Sources