Alerts for self-hosted customer deployments

Alerts for self-hosted customer deployments

# devops# opensource# containers# automation
Alerts for self-hosted customer deploymentsLouis Weston

TL;DR: Distr is an Open Source platform to distribute applications to on-prem, BYOC, and air-gapped...

TL;DR: Distr is an Open Source platform to distribute applications to on-prem, BYOC, and air-gapped customers. It now supports alerts so you get notified about failed updates, crash loops, or unreachable deployments before your customers do.

The Visibility Gap

For your own application, you’ve got this figured out. Grafana alerts, PagerDuty, whatever. Something breaks in production, you know about it in minutes.

For self-hosted customer deployments? Nothing. Radio silence unless they open a ticket.

You have all this sophisticated monitoring for your own infrastructure, but the moment your software crosses over to customer-controlled environments, you’re flying blind. You find out things are broken when your customer tells you they’re broken. Sometimes days later.

You can’t SSH into every customer deployment and tail logs. That doesn’t scale, and frankly, your customers probably wouldn’t let you even if you wanted to.

What We Built

Alerts in Distr work for both sides:

Distr lets you configure alerts in the Vendor portal. Set them once, they apply everywhere. Or get granular if you want.

For customers that manage and update the deployments themselves, you can enable alerts for them as well so they can configure their own alerts in the Customer portal.

This dual approach matters because not all customer relationships look the same. Some of your customers want you monitoring their deployments—they’d rather you catch issues first and fix them proactively. Others prefer to run their own ops and want full control over who gets notified when.

For example, you’ll get alerted if an update fails or gets stuck looping, if we lose contact with a deployment (network issues, dead instance, whatever), or if your app’s in a crash loop. We filter out normal restarts from successful updates though—no point spamming you about routine operations.

The details on setting this up are in the alerts documentation.

How It Helps

The main thing is you catch issues before your customer does. That’s worth a lot.

We had one customer who didn’t realize their deployments were in a crash loop for two days because the app was restarting fast enough that it looked “up” when they checked. With alerts, that would’ve been caught in the first 10 minutes.

Getting Started

If you’re already using Distr, alerts are available now. Vendor settings for your side, customer portal for theirs. Takes about 30 seconds to configure.

If you’re not using Distr yet and you’re dealing with software distribution to on-prem or customer-controlled infrastructure… well, we should probably talk.

Let us know what you think, we would love to hear from you or support us by leaving a star:

⭐ distr-sh/distr ⭐