Correlation Query Library¶

Use these multi-log-group CloudWatch Logs Insights queries to align platform activity with request symptoms so you can test whether deployments, scaling events, or health changes coincide with user-visible failures.

When to Use¶

Use when you suspect a deployment caused an error spike but need time-aligned evidence to confirm the hypothesis.
Use when latency increases during scaling events and you need to determine whether new instances are the cause.
Use during post-incident review to build a unified timeline across platform and application log streams.
Use to disprove coincidental correlation — if deployment timing does not match the error window, look elsewhere.

Prerequisites¶

CloudWatch Logs agent enabled on the Elastic Beanstalk environment (aws:elasticbeanstalk:cloudwatch:logs namespace).
Both HTTP access logs and platform activity logs streaming to CloudWatch.
IAM permissions for logs:StartQuery and logs:GetQueryResults across multiple log groups.
Familiarity with CloudWatch Logs Insights multi-log-group query syntax (selecting multiple log groups in the console or API).

Log Group Patterns¶

/aws/elasticbeanstalk/$ENV_NAME/var/log/nginx/access.log
/aws/elasticbeanstalk/$ENV_NAME/var/log/eb-activity.log

Correlation queries join time-bucketed data from at least two log groups. Select both log groups in the CloudWatch Logs Insights console before running.

flowchart LR
    A[HTTP access logs] --> C[Correlation Query Library]
    B[Platform activity logs] --> C
    C --> D[Deploy vs Errors]
    C --> E[Scaling vs Latency]

Interpretation Guidance¶

Deploy timestamps that precede the error spike by 1–5 minutes strongly suggest the deployment introduced the regression. Check the Deployment Events query for the specific failing hook or version.
Error spikes with no deployment activity in the same window rule out deployment as a cause and point toward traffic surges, dependency failures, or resource exhaustion.
Latency increases that begin exactly when new instances join suggest cold-start overhead or incomplete warm-up. Monitor until latency stabilizes after the scaling event completes.
Latency increases that precede scaling activity indicate that scaling was reactive, not causal — the root cause is the load itself.

Queries¶

Query	Description	Link
Deploy vs Errors	Align deployment activity with HTTP 5xx counts in the same time buckets	deploy-vs-errors.md
Scaling vs Latency	Compare scaling-related platform activity against p95 request latency	scaling-vs-latency.md

Sources¶

https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/AWSHowTo.cloudwatchlogs.html
https://docs.aws.amazon.com/AmazonCloudWatch/latest/logs/CWL_QuerySyntax.html
https://docs.aws.amazon.com/AmazonCloudWatch/latest/logs/CWL_QuerySyntax-examples.html