
A load balancer is the front door to your infrastructure. All user traffic passes through it before reaching your application servers. If the load balancer fails — or if it's routing traffic to unhealthy backends — users can't reach your service. Monitoring the load balancer is as critical as monitoring the application behind it.
Load balancers distribute incoming traffic across multiple backend servers. They also:
Failure modes:
Monitor the load balancer endpoint (your public domain or IP) externally:
Monitor: https://yourdomain.com
Expected status: 200
Interval: 1 minute
This validates the complete path: DNS → load balancer → backend pool → application. If any component fails, this check catches it.
For multi-location monitoring, check from multiple geographic regions — CDN/anycast issues can affect availability regionally.
Configure health checks on your load balancer to detect backend failures:
{
"HealthCheckProtocol": "HTTP",
"HealthCheckPath": "/health",
"HealthCheckPort": "traffic-port",
"HealthCheckIntervalSeconds": 30,
"HealthCheckTimeoutSeconds": 5,
"HealthyThresholdCount": 2,
"UnhealthyThresholdCount": 3
}
When all backends fail health checks, the ALB returns 502 Bad Gateway to all requests — detectable by your external monitor.
upstream backend {
server app1:8080;
server app2:8080;
# Active health checks (Nginx Plus only)
# For open-source: use passive health checking
}
For open-source Nginx, health checking is passive — failed requests are detected as they occur. Nginx Plus supports active health checks.
backend app_servers
option httpchk GET /health
http-check expect status 200
server app1 192.168.1.10:8080 check inter 10s fall 3 rise 2
server app2 192.168.1.11:8080 check inter 10s fall 3 rise 2
HAProxy's active health checking removes backends from rotation on failure and adds them back when they recover.
If your load balancer terminates SSL (common with ALB, Nginx, HAProxy), monitor the certificate:
Monitor type: SSL certificate
Domain: yourdomain.com (points to load balancer)
Alert at: 30 days remaining
An expired certificate at the load balancer prevents all traffic from reaching your backends, regardless of backend health.
See SSL certificate monitoring for complete setup.
For AWS ALBs, configure CloudWatch alarms:
# Alert when all backends are unhealthy
aws cloudwatch put-metric-alarm \
--alarm-name "ALB-HealthyHostCount" \
--metric-name HealthyHostCount \
--namespace AWS/ApplicationELB \
--dimensions Name=LoadBalancer,Value=app/your-alb/xxxx \
--period 60 \
--evaluation-periods 2 \
--threshold 1 \
--comparison-operator LessThanThreshold \
--alarm-actions arn:aws:sns:...
Key ALB metrics to monitor:
HealthyHostCount — alert if count drops to 0UnhealthyHostCount — alert if count rises above 0HTTPCode_ELB_5XX_Count — alert on 5xx errors from the ALB itselfTargetResponseTime — alert on high latencyLoad balancers add minimal latency in healthy conditions. If you see response time increasing in your uptime monitoring data, it may indicate:
Response time trends visible in your monitoring tool's historical data help diagnose these patterns.
| Check | Tool | What It Detects |
|---|---|---|
| External HTTP monitor | Domain Monitor | User-facing availability |
| SSL certificate monitor | Domain Monitor | Certificate expiry |
| HealthyHostCount alarm | CloudWatch | All backends failing |
| ALB 5xx alarm | CloudWatch | Load balancer errors |
| Response time monitor | Domain Monitor | Performance degradation |
The external HTTP monitor from Domain Monitor is your user-perspective check — the others provide internal visibility. Both layers are necessary for complete coverage.
Monitor your load balancer and application stack from outside your infrastructure at Domain Monitor.
Generative AI creates new content — text, images, code, and more. This guide explains how it works, what tools are available, and where it's genuinely useful versus overhyped.
Read moreCursor AI is an AI-powered code editor built on VS Code. Learn what it does, how it works, and whether it's the right tool for your development workflow.
Read moreClaude Opus is Anthropic's most capable AI model, built for complex reasoning and demanding tasks. Learn what it does, how it compares, and when to use it.
Read moreLooking to monitor your website and domains? Join our platform and start today.