Skip to content

Decision Tree

Use this decision tree when time matters more than completeness. Its job is to route you to the most likely evidence path, not to prove root cause by itself.

Main Content

flowchart TD
    A[Symptom] --> B{{Pods healthy?}}
    B -->|No| C[Pod Issues]
    B -->|Yes| D{{Service reachable inside cluster?}}
    D -->|No| E[Connectivity: Service Unreachable]
    D -->|Yes| F{{Ingress reachable externally?}}
    F -->|No| G[Connectivity: Ingress Failure]
    C --> H{{Pending or Image Pull or CrashLoop?}}
    H --> I[Pod Playbooks]
    A --> J{{Nodes Ready?}}
    J -->|No| K[Node Playbooks]
    A --> L{{Change just happened?}}
    L -->|Upgrade| M[Upgrade Failure]
    L -->|Scaling| N[Scaling Failure]

How to Use

  1. Classify the symptom in under one minute.
  2. Pick the playbook that matches the first broken layer.
  3. Gather evidence before changing configuration.
  4. Re-route if the evidence disproves your first hypothesis.

See Also

Sources