Skip to content

Our backup job runs but we miss an alert when it fails, how do we guarantee we'll know?

Classic 'backup failing for 3 months unnoticed'. Fix isn't one more email, it's alerting with a dead-man's switch: alarm on failure AND alarm on silence.

Try this first

  1. 1Configure success and failure mails per backup job. Send to a shared mailbox (backup@company.com) or ticketing tool, not one person. Person sick or gone = blackout.
  2. 2More important: set up a dead-man's switch. Healthchecks.io, Cronitor or a custom Grafana dashboard expects a ping per successful run. No ping = alert.
  3. 3For cloud backup tools with APIs: scrape status yourself and log to your monitoring. Veeam, NAKIVO, Synology have REST APIs. Without your own monitoring you trust only the vendor.
  4. 4Multiple channels: email, Slack or Teams, SMS for genuinely critical jobs. A mail in spam isn't an alert.
  5. 5Test alerting: deliberately disable a job and check whether the alert fires. No alert in 24h = your alerting doesn't work as you thought.
  6. 6Schedule a weekly review: someone looks at the dashboard Friday morning, separate from automated alerts. That catches subtle issues (job suddenly runs faster, retention filling up) that alerts miss.

When to bring us in

With multiple backup tools at once, monitoring becomes a puzzle. A central tool like Veeam ONE, NinjaRMM or an MSP that runs it can take the overhead. Ask for a review.

See also

None of the above fits?

Describe your situation below. We pass your input plus the steps you already saw to our AI and return tailored next-step advice. If it's too risky to DIY, we'll say so.

Who are you?

For the AI question we need your email and company, so we can follow up if the AI gets stuck, and to prevent abuse.

Limited to 2 questions per hour and 5 per day, kept lean so the AI stays useful. For more, contacting us directly works better for you and us.

Or skip the DIY entirely

Our Managed IT clients do not look these things up. One point of contact, a fixed monthly price, resolved within working hours.