Question 1

Fly.io has machine health checks — why use SitePulse too?

Accepted Answer

Fly's health checks determine whether to route traffic to a machine — they run at the infrastructure layer. SitePulse runs at the application layer, probing your public URL the same way a real user does. A machine can pass Fly's TCP health check and still return 500 on every HTTP request because of a broken env var, a crashed dependency, or a bad database migration. SitePulse catches application-level failures that Fly's infrastructure checks can't see.

Question 2

My Fly app is deployed across multiple regions — how does monitoring work?

Accepted Answer

SitePulse probes from a single location (Tokyo) and hits your app's anycast address — Fly routes the probe to the nearest healthy region. This tests your global routing and at least one region's health. If a specific region is degraded but others are healthy, Fly usually routes around it automatically. For region-specific monitoring, the most practical approach is to add monitors for region-specific URLs if your app exposes them.

Question 3

How do I monitor Fly Postgres?

Accepted Answer

Don't expose your Fly Postgres instance publicly. Instead, add a /api/health route to your app that does a lightweight database query and returns 200 or 503. Point SitePulse at that endpoint. When Fly Postgres degrades, your health check route will return 503, and SitePulse will email you within a minute.

Question 4

What about Fly Machines that scale to zero?

Accepted Answer

If you're using Fly Machines with scale-to-zero, the first request after idle wakes the machine — this can take 2–10 seconds. SitePulse's probe will see a slow response or timeout depending on your timeout setting. To prevent scale-to-zero from affecting real users, set a SitePulse monitor with a 5-minute interval to keep at least one machine warm.

Question 5

Can I monitor internal Fly services?

Accepted Answer

Only if they have a public endpoint. For internal services accessible via Fly's private network (6PN), you can't directly probe them from outside. The best option is to add a health-check route to your public-facing app that proxies a check to your internal service and returns the result.

Uptime monitoring for Fly.io apps

What Fly can't see

What to monitor on a Fly.io app

Production URL

/api/health endpoint

Scale-to-zero keep-alive

Background workers

Fly Postgres health

Public status page

Set it up in 60 seconds

What you get

1-minute checks

SSL expiry alerts

Public status page

Add application-layer monitoring to your Fly app

Frequently asked questions