Skip to content

Detector Map: Symptom to Playbook

Use this detector map to move from first symptom to the most likely troubleshooting playbook with minimal guesswork.

Observed Baseline Lifecycle Signals (Real Deployment)

Reason_s Type_s Typical count Interpretation during triage
ProbeFailed Warning 74 Often expected during cold start if revision later becomes ready
RevisionUpdate Normal 14 Revision template/config updated
ContainerAppUpdate Normal 9 App-level config update accepted
RevisionReady Normal 7 Revision reached healthy ready state
ContainerAppReady Normal 6 App reached running state
KEDAScalersStarted Normal 6 Scalers activated for revision
RevisionDeactivating Normal 5 Prior revision being drained/deactivated
ContainerStarted Normal 3 Container process started
PulledImage Normal 3 Image pull completed successfully
ContainerCreated Normal 3 Container object created
AssigningReplica Normal 3 Replica scheduled to node
PullingImage Normal 2 Image pull started
ContainerTerminated Warning 2 Container exited; check exit code/context

Symptom Routing Flow

flowchart TD
    A[Start: user-visible symptom] --> B{Revision created and healthy?}
    B -->|No| C{System logs show pull or auth?}
    C -->|Yes| P1[Image Pull Failure]
    C -->|No| C2{Probe or startup timeout?}
    C2 -->|Yes| P4[Probe Failure and Slow Start]
    C2 -->|No| P2[Revision Provisioning Failure]

    B -->|Yes| D{Replica stable and running?}
    D -->|No| D1{OOM or restart signals?}
    D1 -->|Yes| P11[CrashLoop OOM and Resource Pressure]
    D1 -->|No| P3[Container Start Failure]

    D -->|Yes| E{Ingress or client path failing?}
    E -->|Yes| E1{Public or internal endpoint issue?}
    E1 -->|Public| P5[Ingress Not Reachable]
    E1 -->|Internal DNS| P6[Internal DNS and Private Endpoint Failure]
    E1 -->|Service-to-service| P7[Service-to-Service Connectivity Failure]

    E -->|No| F{Scaling behavior unexpected?}
    F -->|HTTP load not scaling| P8[HTTP Scaling Not Triggering]
    F -->|Event backlog not scaling| P9[Event Scaler Mismatch]
    F -->|No| G{Identity or secrets failing?}
    G -->|Identity token or RBAC| P10[Managed Identity Auth Failure]
    G -->|Secret or Key Vault reference| P12[Secret and Key Vault Reference Failure]
    G -->|No| H{Platform feature path}
    H -->|Dapr sidecar or component| P13[Dapr Sidecar or Component Failure]
    H -->|Container App job| P14[Container App Job Execution Failure]
    H -->|Post-deploy regression| P15[Bad Revision Rollout and Rollback]

Error String Mapping

Error string or pattern Recommended playbook
ImagePullBackOff, manifest unknown, unauthorized, denied Image Pull Failure
Provisioning failed, secretRef not found, template validation errors Revision Provisioning Failure
Traceback, Address already in use, startup command exit Container Start Failure
Readiness probe failed, liveness probe failed, startup timeout Probe Failure and Slow Start
502/504 from app FQDN, upstream connect error Ingress Not Reachable
Temporary failure in name resolution, NXDOMAIN Internal DNS and Private Endpoint Failure
connection refused, TLS handshake timeout between services Service-to-Service Connectivity Failure
HTTP latency spike with flat replica count HTTP Scaling Not Triggering
Queue backlog grows, no scale-out, KEDA trigger errors Event Scaler Mismatch
CrashLoopBackOff, OOMKilled, frequent replica restarts CrashLoop OOM and Resource Pressure
ManagedIdentityCredential errors, 401/403 to Azure resource Managed Identity Auth Failure
Secret reference errors, Key Vault access denied Secret and Key Vault Reference Failure
Dapr invocation/state/pubsub failures Dapr Sidecar or Component Failure
Job Failed / TimedOut / retry storm Container App Job Execution Failure
Errors spike immediately after traffic shift Bad Revision Rollout and Rollback

Playbook Catalog

Startup and Provisioning

Ingress and Networking

Scaling and Runtime

Identity and Configuration

Platform Features

See Also