Website monitoring checklist showing all required monitoring checks for production web applications
# guides

Website Monitoring Checklist for Developers

Before you launch a production application — and periodically thereafter — run through this monitoring checklist. It covers every layer of monitoring a production web application needs, from basic uptime checks to SSL certificates and background job monitoring.

Before Launch: Minimum Viable Monitoring

At bare minimum, set these up before any production launch:

  • HTTP uptime monitor on your main domain — 1-minute check interval
  • SSL certificate monitor — alerts at 30 days before expiry
  • Domain expiry monitor — alerts at 60 days before expiry
  • SMS alert configured for downtime (not just email)
  • Recovery alert enabled so you know when issues resolve

This takes about 10 minutes to configure and protects you from the most common failure modes. See how to set up uptime monitoring.


Uptime Monitoring

HTTP Checks

  • Main domain monitored (https://yourdomain.com)
  • Critical application endpoints monitored (not just homepage)
  • API health endpoint monitored (/health or /api/health)
  • Content verification enabled (not just status code check)
  • Check interval set appropriately (1 min for production, 5 min for staging)
  • Multi-location checks enabled (at least 3 locations)
  • Confirmation count set to 2 (avoids false alarms from transient blips)

For APIs

  • Health endpoint returns 200 with useful JSON ({"status": "ok"})
  • Authentication endpoints monitored
  • Critical API routes monitored
  • Response time threshold configured

SSL Certificate Monitoring

  • SSL certificate monitored for each domain (including subdomains with separate certs)
  • Alert at 30 days remaining (email/Slack)
  • Alert at 14 days remaining (escalated)
  • Alert at 7 days remaining (urgent)
  • Auto-renewal verified working (Let's Encrypt, cert-manager, etc.)
  • Wildcard certificates accounted for

See the full guide: SSL certificate monitoring


Domain Registration Monitoring

  • Domain expiry monitored for all registered domains
  • Alert at 60 days remaining
  • Alert at 30 days remaining
  • Alert at 14 days remaining
  • Auto-renewal enabled at registrar
  • Domain lock (clientTransferProhibited) enabled
  • Registrant contact information is current

See the full guide: domain expiry monitoring


Alert Configuration

  • SMS alerts for P1/P2 downtime
  • Slack alerts for team visibility
  • Email alerts for SSL/domain warnings
  • Recovery alerts enabled
  • Alert routing verified (correct people receiving alerts)
  • Alerts tested (deliberately triggered and confirmed received)
  • Maintenance windows configured for scheduled deployments
  • On-call rotation documented

See the full guide: how to set up downtime alerts


Background Jobs and Scheduled Tasks

For any cron jobs or background workers:

  • Heartbeat monitor configured for each critical cron job
  • Heartbeat interval matches cron schedule
  • Grace period accounts for expected execution time
  • Ping URL added to job script (on successful completion)
  • Workers with HTTP ports have external HTTP monitors

See the full guide: how to monitor cron jobs


Application Health Endpoints

Verify your application has proper health endpoints:

  • /health endpoint exists and returns 200
  • Liveness check: process alive (fast, lightweight)
  • Readiness check: dependencies available (database, cache, external APIs)
  • Health endpoints accessible to monitoring service
  • Health endpoints not accessible publicly (or rate limited)

For Kubernetes: monitoring Kubernetes pods


Platform-Specific Checks

For hosted platforms:


Incident Response Readiness

  • Incident response runbook documented and accessible
  • On-call rotation defined
  • Public status page set up
  • Status page subscriber notifications configured
  • Escalation procedure documented
  • Post-mortem process defined

See related guides:


Periodic Review (Quarterly)

  • Verify all monitors are still pointing at correct URLs
  • Test alert delivery (deliberately trigger a monitor failure)
  • Review false positive rate — adjust confirmation counts if needed
  • Update on-call contact information
  • Review incident history — any recurring patterns?
  • Check new services added since last review
  • Confirm monitoring coverage matches current architecture

Documentation

  • Monitoring setup documented in team wiki
  • Monitor list and their purpose documented
  • Alert routing documented
  • Maintenance window procedure documented

This checklist covers the monitoring needs of most production web applications. Bookmark it, run through it at launch, and revisit quarterly.


Check off this list using Domain Monitor — uptime, SSL, domain, and heartbeat monitoring in one place.

More posts

What Is a Subdomain Takeover and How to Prevent It

A subdomain takeover lets an attacker claim your subdomain by exploiting dangling DNS records. Learn how it happens, real-world examples, and how DNS monitoring detects it.

Read more
What Is Mean Time to Detect (MTTD)?

Mean time to detect (MTTD) measures how long it takes to discover an incident after it starts. Reducing MTTD is one of the highest-leverage improvements in reliability engineering.

Read more
What Is Black Box Monitoring?

Black box monitoring tests your systems from the outside, the way users experience them — without access to internal code or infrastructure. Learn how it works and when to use it.

Read more

Subscribe to our PRO plan.

Looking to monitor your website and domains? Join our platform and start today.