Skip to content

Evidence Map¶

Use this page to decide which commands and signals matter for each AKS symptom category. Better evidence collection prevents random corrective changes.

Main Content¶

flowchart TD
    A[Symptom Category] --> B[kubectl state]
    A --> C[Events]
    A --> D[Node signals]
    A --> E[Azure resource state]
    A --> F[Logs and metrics]

Symptom	First Evidence	Follow-up Evidence
Pod not starting	`kubectl get pods`, `kubectl describe pod`	container logs, image pull secret, node conditions
Service unreachable	`kubectl get svc`, `kubectl get endpoints`	selector labels, NetworkPolicy, DNS lookup
Ingress broken	`kubectl get ingress`, controller logs	load balancer state, backend endpoints, cert/TLS
Node unhealthy	`kubectl get nodes`, `kubectl describe node`	daemonsets, CNI state, quota, subnet IPs
Upgrade issue	cluster version, events	controller compatibility, PDBs, node image rollout

High-value command groups¶

kubectl get events -A --sort-by=.lastTimestamp
kubectl get pods -A -o wide
kubectl get nodes -o wide
az aks show --resource-group $RG --name $CLUSTER_NAME --output json

See Also¶

Sources¶