Website monitoring checklist showing all required monitoring checks for production web applications
# guides

Website Monitoring Checklist for Developers

Before you launch a production application — and periodically thereafter — run through this monitoring checklist. It covers every layer of monitoring a production web application needs, from basic uptime checks to SSL certificates and background job monitoring.

Before Launch: Minimum Viable Monitoring

At bare minimum, set these up before any production launch:

  • HTTP uptime monitor on your main domain — 1-minute check interval
  • SSL certificate monitor — alerts at 30 days before expiry
  • Domain expiry monitor — alerts at 60 days before expiry
  • SMS alert configured for downtime (not just email)
  • Recovery alert enabled so you know when issues resolve

This takes about 10 minutes to configure and protects you from the most common failure modes. See how to set up uptime monitoring.


Uptime Monitoring

HTTP Checks

  • Main domain monitored (https://yourdomain.com)
  • Critical application endpoints monitored (not just homepage)
  • API health endpoint monitored (/health or /api/health)
  • Content verification enabled (not just status code check)
  • Check interval set appropriately (1 min for production, 5 min for staging)
  • Multi-location checks enabled (at least 3 locations)
  • Confirmation count set to 2 (avoids false alarms from transient blips)

For APIs

  • Health endpoint returns 200 with useful JSON ({"status": "ok"})
  • Authentication endpoints monitored
  • Critical API routes monitored
  • Response time threshold configured

SSL Certificate Monitoring

  • SSL certificate monitored for each domain (including subdomains with separate certs)
  • Alert at 30 days remaining (email/Slack)
  • Alert at 14 days remaining (escalated)
  • Alert at 7 days remaining (urgent)
  • Auto-renewal verified working (Let's Encrypt, cert-manager, etc.)
  • Wildcard certificates accounted for

See the full guide: SSL certificate monitoring


Domain Registration Monitoring

  • Domain expiry monitored for all registered domains
  • Alert at 60 days remaining
  • Alert at 30 days remaining
  • Alert at 14 days remaining
  • Auto-renewal enabled at registrar
  • Domain lock (clientTransferProhibited) enabled
  • Registrant contact information is current

See the full guide: domain expiry monitoring


Alert Configuration

  • SMS alerts for P1/P2 downtime
  • Slack alerts for team visibility
  • Email alerts for SSL/domain warnings
  • Recovery alerts enabled
  • Alert routing verified (correct people receiving alerts)
  • Alerts tested (deliberately triggered and confirmed received)
  • Maintenance windows configured for scheduled deployments
  • On-call rotation documented

See the full guide: how to set up downtime alerts


Background Jobs and Scheduled Tasks

For any cron jobs or background workers:

  • Heartbeat monitor configured for each critical cron job
  • Heartbeat interval matches cron schedule
  • Grace period accounts for expected execution time
  • Ping URL added to job script (on successful completion)
  • Workers with HTTP ports have external HTTP monitors

See the full guide: how to monitor cron jobs


Application Health Endpoints

Verify your application has proper health endpoints:

  • /health endpoint exists and returns 200
  • Liveness check: process alive (fast, lightweight)
  • Readiness check: dependencies available (database, cache, external APIs)
  • Health endpoints accessible to monitoring service
  • Health endpoints not accessible publicly (or rate limited)

For Kubernetes: monitoring Kubernetes pods


Platform-Specific Checks

For hosted platforms:


Incident Response Readiness

  • Incident response runbook documented and accessible
  • On-call rotation defined
  • Public status page set up
  • Status page subscriber notifications configured
  • Escalation procedure documented
  • Post-mortem process defined

See related guides:


Periodic Review (Quarterly)

  • Verify all monitors are still pointing at correct URLs
  • Test alert delivery (deliberately trigger a monitor failure)
  • Review false positive rate — adjust confirmation counts if needed
  • Update on-call contact information
  • Review incident history — any recurring patterns?
  • Check new services added since last review
  • Confirm monitoring coverage matches current architecture

Documentation

  • Monitoring setup documented in team wiki
  • Monitor list and their purpose documented
  • Alert routing documented
  • Maintenance window procedure documented

This checklist covers the monitoring needs of most production web applications. Bookmark it, run through it at launch, and revisit quarterly.


Check off this list using Domain Monitor — uptime, SSL, domain, and heartbeat monitoring in one place.

More posts

What Is Generative AI? How It Works and What It Creates

Generative AI creates new content — text, images, code, and more. This guide explains how it works, what tools are available, and where it's genuinely useful versus overhyped.

Read more
What Is Cursor AI? The AI Code Editor Explained

Cursor AI is an AI-powered code editor built on VS Code. Learn what it does, how it works, and whether it's the right tool for your development workflow.

Read more
What Is Claude Opus? Anthropic's Most Powerful Model Explained

Claude Opus is Anthropic's most capable AI model, built for complex reasoning and demanding tasks. Learn what it does, how it compares, and when to use it.

Read more

Subscribe to our PRO plan.

Looking to monitor your website and domains? Join our platform and start today.