Monitoring and Alerting¶
Track storage health, performance, and availability metrics.
| Metric | Category | Description |
|---|---|---|
| Availability | Health | Percentage of successful requests. |
| Latency | Performance | Time taken to process requests. |
| Transactions | Load | Number of storage operations. |
| Egress/Ingress | Data | Volume of data moved in/out. |
| Capacity | Usage | Total used storage space. |
Tip
Set alerts on "Availability < 99%" and "E2E Latency > Threshold" for early incident detection.
mermaid graph TD A[Monitor] --> B[Metrics] B --> C[Alerts] C --> D[Email/SMS] A --> E[Logs] E --> F[Log Analytics]
Monitoring Checklist¶
- Define baseline thresholds for latency and availability.
- Configure metric alerts for 429, 503, and latency spikes.
- Route alerts to on-call channels and ticketing systems.
- Enable diagnostics for blob, file, queue, and table services.
- Retain logs long enough for incident and trend analysis.
- Review capacity and transaction growth monthly.