Monitoring check frequency comparison chart showing tradeoffs between 30 second, 1 minute and 5 minute intervals
# website monitoring

How to Choose Your Uptime Monitoring Check Frequency

One of the first decisions when setting up uptime monitoring is how often your monitors should run. Check too infrequently and you'll discover outages late. Check too frequently and you're paying for granularity you don't need.

The right check frequency depends on your application's criticality, your SLA requirements, and how quickly you can respond to an alert.

What Check Frequency Actually Determines

Check frequency sets the worst-case detection time — the maximum amount of time between when a failure starts and when your monitor first detects it.

  • 30-second checks: worst case 30 seconds to detection
  • 1-minute checks: worst case 60 seconds to detection
  • 5-minute checks: worst case 5 minutes to detection

Note that detection time and alert time are different. If you require 2 consecutive failures before alerting (which is best practice), add one additional check interval to the detection time.

Check IntervalMax Detection TimeWith 2-Failure Confirmation
30 seconds30 seconds~60 seconds
1 minute60 seconds~2 minutes
5 minutes5 minutes~10 minutes
15 minutes15 minutes~30 minutes

Choosing by Application Type

High-Traffic Production Applications

Recommended: 30-second or 1-minute checks

For applications serving real users and generating revenue, fast detection is worth the investment. Every minute of undetected downtime is lost revenue and damaged user trust.

30-second checks make sense when:

  • You have a formal SLA requiring fast incident response
  • Your application handles payments or real-time transactions
  • Your MTTR target requires detection under 1 minute
  • Your mean time to recovery is a tracked business metric

1-minute checks are appropriate for:

  • Most standard production web applications
  • SaaS products with active users
  • E-commerce stores

APIs and Backend Services

Recommended: 1-minute checks

APIs often power mobile apps and integrations where failures cause cascading problems. 1-minute checks give you fast detection without over-checking.

For monitoring AI API endpoints where downstream applications depend on availability, 1-minute checks with SMS alerts are appropriate.

Internal Tools and Admin Panels

Recommended: 5-minute checks

Internal tools typically have lower urgency — a 5-minute delay in discovering a problem is acceptable. These also tend to have fewer users and lower business impact.

Staging and Development Environments

Recommended: 5-15 minute checks

You want to know if staging goes down (especially before a demo or launch), but you don't need the same urgency as production. 5-minute checks with email-only alerts are appropriate.

Heartbeat Monitors (Cron Jobs)

Interval-dependent: For heartbeat monitoring, the check frequency should match the job's schedule. A cron job that runs every 5 minutes should send a heartbeat every 5 minutes; the monitor should alert if no heartbeat arrives within 7-10 minutes.

The False Positive Tradeoff

Higher check frequency means more checks — and more opportunities for transient network failures between the monitoring server and yours to generate false positives.

A transient failure that lasts 45 seconds:

  • With 30-second checks, 2-failure confirmation: could generate an alert (two failures in a row)
  • With 1-minute checks, 2-failure confirmation: no alert (the second check runs after recovery)

This is why confirmation counts matter. If you use very frequent checks (30 seconds), increase your confirmation count to 3 to maintain a reasonable signal-to-noise ratio.

SLA Requirements and Check Frequency

If you're managing a service level agreement that specifies uptime targets, check frequency affects how accurately you can measure compliance.

With 1-minute checks, you might not detect a 45-second outage at all. With 30-second checks, you capture more incidents. For SLA reporting purposes, higher-frequency checks give more accurate uptime percentages.

A common setup for SLA-governed applications:

  • 30-second checks for real-time detection
  • Monitored from multiple locations for accuracy
  • Reports exported monthly for SLA compliance review

Multi-Location and Check Frequency

If you're using multi-location monitoring, each location runs at the configured interval. This means your effective check frequency from any individual location is the configured interval — but you're getting more data points overall.

For a 1-minute interval with 5 monitoring locations, you receive 5 checks per minute. This provides faster anomaly detection and better confidence that alerts represent real issues.

Use CaseCheck IntervalConfirmationAlert Channels
E-commerce (revenue critical)30 seconds2 failuresSMS + Slack
SaaS application1 minute2 failuresSMS + Slack
API service1 minute2 failuresSMS + Slack
Marketing site5 minutes2 failuresEmail + Slack
Internal tool5 minutes2 failuresEmail
Staging environment15 minutes1 failureEmail
Cron job heartbeatMatch scheduleTimeout windowEmail + Slack

Changing Check Frequency Over Time

Start with 1-minute checks for any new production service. Review after a month:

  • High false positive rate? Increase confirmation count or reduce frequency
  • Missing incidents or detecting them late? Increase frequency
  • High traffic growth? Consider reducing to 30 seconds

Check frequency isn't permanent — adjust it as your application grows and your requirements change.


Configure your monitoring check frequency at Domain Monitor — choose from 30-second to 15-minute intervals with multi-location support.

More posts

What Is Generative AI? How It Works and What It Creates

Generative AI creates new content — text, images, code, and more. This guide explains how it works, what tools are available, and where it's genuinely useful versus overhyped.

Read more
What Is Cursor AI? The AI Code Editor Explained

Cursor AI is an AI-powered code editor built on VS Code. Learn what it does, how it works, and whether it's the right tool for your development workflow.

Read more
What Is Claude Opus? Anthropic's Most Powerful Model Explained

Claude Opus is Anthropic's most capable AI model, built for complex reasoning and demanding tasks. Learn what it does, how it compares, and when to use it.

Read more

Subscribe to our PRO plan.

Looking to monitor your website and domains? Join our platform and start today.