Skip to content

Troubleshooting Decision Tree

Use this page when you need to route from a storage symptom to the most relevant playbook in the first few minutes.

Main triage tree

```mermaid flowchart TD A[Incident begins] --> B{What is the dominant symptom?} B -->|Cannot connect or mount| C[Access branch] B -->|403 or token problem| D[Security branch] B -->|Slow or throttled| E[Performance branch] B -->|Missing overwritten deleted data| F[Recovery branch]

C --> C1{DNS or endpoint path issue?}
C1 -->|Yes| P1[Private Endpoint and DNS Issues]
C1 -->|No| C2{File share mount specific?}
C2 -->|Yes| P2[File Share Mount Issues]
C2 -->|No| C3{Public vs private confusion?}
C3 -->|Yes| P3[Public vs Private Access Confusion]
C3 -->|No| P4[Cannot Access Storage Account]

D --> D1{Using SAS or temporary token?}
D1 -->|Yes| P5[SAS and Token Issues]
D1 -->|No| P6[Authorization Failures]

E --> E1{429 or 503 present?}
E1 -->|Yes| P7[Throttling and Performance Issues]
E1 -->|No| P8[Slow Upload / Download]

F --> P9[Data Protection and Recovery Issues]

```

Quick symptom matrix

Symptom Pattern Likely Category Playbook
Endpoint unreachable, timeout, DNS mismatch Access Cannot Access Storage Account
Private endpoint exists but client still resolves public IP Access Private Endpoint and DNS Issues
SMB/NFS mount fails from client Access File Share Mount Issues
403 with Azure AD, shared key, or role confusion Security Authorization Failures
SAS rejected, expired, not-yet-valid, wrong scope Security SAS and Token Issues
429/503 and storage metrics show pressure Performance Throttling and Performance Issues
Transfer is slow but account not throttling Performance Slow Upload / Download
Blob/file was deleted or overwritten Recovery Data Protection and Recovery Issues

Branch prompts to ask in order

  1. Is the request reaching the intended endpoint path at all?
  2. Is the dominant error about connectivity, authorization, or throughput?
  3. Is the failure isolated to private endpoint, file share protocol, or SAS/token flow?
  4. Do metrics show throttling, or is the bottleneck outside the storage account?
  5. If data is missing, which protection features were enabled before the incident?

Minimal evidence before picking a branch

  • Exact error code and timestamp.
  • Storage service involved: Blob, Files, Queue, or Table.
  • DNS answer and public/private endpoint expectation.
  • Auth method in use: Azure AD, SAS, or shared key.
  • Metrics snapshot for latency, success rate, and transaction volume.

See Also

Sources