Request Lifecycle¶

Every request to Azure App Service travels through multiple platform layers before reaching your application process. Understanding this lifecycle is essential for troubleshooting latency, timeout behavior, routing issues, and scale-related anomalies.

Prerequisites¶

Familiarity with HTTP(S), DNS, and reverse proxies
Basic understanding of load balancing and health probes
Access to App Service logs and metrics

Main Content¶

End-to-end request path¶

sequenceDiagram
    participant Client as Client
    participant FE as App Service Frontend
    participant Worker as Worker Instance Reverse Proxy
    participant App as Application Process

    Client->>FE: HTTPS request (443)
    FE->>Worker: Forward to selected instance
    Worker->>App: Proxy to local application port
    App-->>Worker: Response
    Worker-->>FE: Response
    FE-->>Client: HTTP response

Stage 1: DNS and global entry¶

Requests begin with DNS resolution of the app hostname. App Service supports platform hostnames and custom domains. After DNS resolution:

TLS handshake occurs
SNI and host header route to the correct app
Global edge and frontend infrastructure direct traffic to the right stamp/region

Stage 2: Frontend routing¶

Frontend components perform:

TLS termination
Hostname validation
Access restriction evaluation
Route selection to a healthy worker instance

If no healthy backend is available, requests can fail at the frontend before your app code executes.

Stage 3: Worker reverse proxy handoff¶

On the worker, a local reverse proxy passes traffic to your application process listening on the platform-assigned port.

Port contract

Your application must bind to the port provided by the platform environment. Binding to a fixed local port can cause startup success but request failures.

Stage 4: Application execution¶

Your app handles routing, business logic, and dependency calls, then returns response status/body/headers.

Performance at this stage depends on:

Application CPU and memory consumption
Dependency latency (database, API, cache)
Thread/process/event-loop saturation characteristics
Connection pooling and outbound networking configuration

Response return path¶

Responses travel back through worker and frontend layers to the client. Response headers may be modified by platform policies such as compression, security headers, and reverse-proxy metadata injection.

Timeout and connection behaviors¶

Platform-level timeout behavior is critical for request design.

Behavior	Typical Impact
Frontend request timeout	Long-running requests may return gateway timeout
Idle connection timeout	Idle sockets can be closed by infrastructure
Slow dependency path	Queue buildup and elevated tail latency

Design guidance:

Keep interactive requests short
Offload long work to background pipelines
Return 202 Accepted for asynchronous workflows

Warning

Holding HTTP requests open for background processing increases timeout risk and reduces available concurrency.

Instance selection and session affinity¶

By default, frontend routing distributes traffic across healthy instances. Optional session affinity can pin a client to a specific instance using cookies.

graph TD
    FE[Frontend] --> I1[Instance 1]
    FE --> I2[Instance 2]
    FE --> I3[Instance 3]
    ClientA[Client A] -.Affinity Cookie.-> I2

Affinity trade-offs:

Can simplify legacy in-memory session use
Can produce uneven load distribution
Reduces resilience if an instance fails

Preferred pattern: externalize session/state to a shared store.

Health checks and request eligibility¶

Health checks influence whether an instance receives traffic.

Healthy instance: included in routing pool
Unhealthy instance: removed from routing pool
Recovering instance: reintroduced after passing probes

Probe design should be lightweight and representative of app readiness.

Deployment slots and lifecycle impact¶

With deployment slots:

New version warms in non-production slot
Health checks validate startup
Slot swap redirects production hostname

This reduces user-facing cold starts and failed startup exposure.

Observability along the lifecycle¶

Correlate these signals for end-to-end insight:

Request logs and status code distributions
Frontend-generated diagnostics
Instance restart events
Dependency timing and failure rates
Application-level correlation IDs

Portal view: Log stream¶

Log stream is the fastest portal surface for watching a request move through the runtime path in near real time. The Runtime selector, per-instance picker, and Last 30 minutes lookback let you isolate which worker handled a call, while the console lines expose request metadata, response status, and exporter activity such as Azure Monitor telemetry delivery. That makes this blade central to lifecycle observability: you can correlate application logs, instance affinity, and downstream instrumentation without waiting for slower aggregated reports.

CLI examples for lifecycle inspection¶

Enable and inspect HTTP logs:

az webapp log config \
    --resource-group "$RG" \
    --name "$APP_NAME" \
    --application-logging filesystem \
    --detailed-error-messages true \
    --failed-request-tracing true \
    --web-server-logging filesystem

Stream logs in real time:

az webapp log tail \
    --resource-group "$RG" \
    --name "$APP_NAME"

Inspect access restriction settings that affect frontend admission:

az webapp config access-restriction show \
    --resource-group "$RG" \
    --name "$APP_NAME" \
    --output json

Example output snippet (PII masked):

{
  "ipSecurityRestrictions": [
    {
      "action": "Allow",
      "description": null,
      "headers": null,
      "ipAddress": "203.0.113.0/24",
      "name": "corp-office",
      "priority": 100,
      "subnetMask": null,
      "subnetTrafficTag": null,
      "tag": "Default",
      "vnetSubnetResourceId": null,
      "vnetTrafficTag": null
    }
  ],
  "scmIpSecurityRestrictionsUseMain": false
}

Common lifecycle failure patterns¶

Symptom	Likely Layer	First Checks
403 before app logs	Frontend restrictions	Access restrictions, auth settings
502/503 bursts	Worker/app startup	Restarts, health probe failures
504 responses	Long request path	Dependency latency, request design
Intermittent timeout	Outbound saturation	SNAT, connection pool settings

Advanced Topics¶

Request queueing and tail latency¶

Median latency can look healthy while p95/p99 degrades under load. Track queue indicators and tail percentiles, not only averages.

WebSockets and long-lived connections¶

If your app uses long-lived connections, validate platform support configuration, idle timeout behavior, and scale-out connection distribution.

Graceful shutdown during recycles¶

When instances recycle, in-flight requests should complete quickly, and background workers should checkpoint progress externally.

Incident triage workflow¶

Confirm frontend admission (DNS/TLS/restrictions)
Confirm worker health and restart events
Confirm app process readiness
Confirm dependency latency and outbound connectivity

Language-Specific Details¶

For language-specific implementation details, see: - Node.js Guide - Python Guide - Java Guide - .NET Guide