-
-
Notifications
You must be signed in to change notification settings - Fork 6.9k
Description
⚠️ Please verify that your question has not already been reported
- I have searched the existing issues and found no similar reports.
🛡️ Security Policy
- I have read and agree to Uptime Kuma's Security Policy.
📝 Describe your problem
Hi Uptime Kuma, I'd like to ask for help regarding the following issue I've had for the past two weeks with the tool.
Context:
We host most of our external, client-facing services as well as internal services on Google Cloud's Compute Engine virtual machine (VM) instances. For those services that have a static external IP, we
- Create a DNS record for the IP in Cloudflare
- Set up a reverse proxy for the DNS record's corresponding hostname using NGINX
- Protect the hostname's corresponding URL behind Cloudflare Zero Trust
- Make sure the service is up and running by adding a Uptime Kuma monitor for the service, with the following configuration:
- Monitor Type: HTTP(s)
- URL: https://<server-hostname-specified-in-nginx-configuration>/
- Heartbeat Interval (Check every 60 seconds): 60
- Retries: 2
- Heartbeat Retry Interval (Retry every 60 seconds): 60
- Request Timeout (Timeout after 48 seconds): 48
- Resend Notification if Down X times consecutively (Resend disabled): 0
- Max. Redirects: 10
- Accepted Status Codes: 200-299
- Headers:
{ "CF-Access-Client-Id": "<cloudflare-access-client-id>", "CF-Access-Client-Secret": "<cloudflare-access-client-secret>" }
We've had this same set up all year, and it worked great until earlier this month.
Issue:
The issue was first detected on Wednesday, Oct 15 at 8:52pm MT. We found out that one of our services was offline, which surprised us because the service's Uptime Kuma monitor didn't send us an alert; instead, the monitor kept saying status is "up" and message is "200 - OK". When we brought the service back online, the monitor also didn't send us a notification.
It's been almost 10 days since the issue was first detected. At this point, we are aware that at least half of the monitors aren't working (although we've confirmed that some of the monitors are working, sending us alerts when the correspond services went down). For those services, I've had to manually click on their links every two hours to ensure that they're online, which defeats the purpose of Uptime Kuma.
Our attempts to resolve the issue so far:
- Restarted Uptime Kuma's Docker container
- Restarted the Google Cloud Compute Engine VM instance where Uptime Kuma's Docker container is hosted
- Migrated Uptime Kuma from v1 to v2
Unfortunately, none of those attempts resolved the issue. And we continue to have some Uptime Kuma monitors working and some not working.
📝 Error Message(s) or Log
N/A🐻 Uptime-Kuma Version
2.0.2
💻 Operating System and Arch
Debian GNU/Linux 12 (bookworm) x86_64
🌐 Browser
Google Chrome 141.0.7390.108 (Official Build) (arm64)
🖥️ Deployment Environment
- Runtime Environment:
- Docker: Version
28.3.2(Build578ccf6) - Docker Compose: Version
v2.38.2
- Docker: Version
- Database: N/A
- Database Storage:
- Filesystem:
- Linux: ext4
- Storage Medium: SSD
- Filesystem:
- Uptime Kuma Setup:
- Number of monitors:
22
- Number of monitors: