Skip to content

Evidence Map

Use this page to decide which commands and signals matter for each AKS symptom category. Better evidence collection prevents random corrective changes.

Main Content

flowchart TD
    A[Symptom Category] --> B[kubectl state]
    A --> C[Events]
    A --> D[Node signals]
    A --> E[Azure resource state]
    A --> F[Logs and metrics]
Symptom First Evidence Follow-up Evidence
Pod not starting kubectl get pods, kubectl describe pod container logs, image pull secret, node conditions
Service unreachable kubectl get svc, kubectl get endpoints selector labels, NetworkPolicy, DNS lookup
Ingress broken kubectl get ingress, controller logs load balancer state, backend endpoints, cert/TLS
Node unhealthy kubectl get nodes, kubectl describe node daemonsets, CNI state, quota, subnet IPs
Upgrade issue cluster version, events controller compatibility, PDBs, node image rollout

High-value command groups

kubectl get events -A --sort-by=.lastTimestamp
kubectl get pods -A -o wide
kubectl get nodes -o wide
az aks show --resource-group $RG --name $CLUSTER_NAME --output json

See Also

Sources