"Escalation" sounds like an enterprise word. It conjures up on-call rotations, incident commanders, PagerDuty dashboards, a room full of ops engineers. So solo developers and small teams tend to assume it isn't for them — that escalation is something you adopt later, once you're big enough to have a "team" worth the name.
That's backwards. A solo developer is the person who needs escalation most, because there is no one else. On a big team, if one engineer sleeps through an alert, another might catch it. When you're on your own, a missed alert is just a missed alert — the outage runs until you happen to notice. Escalation is the mechanism that substitutes for the second person you don't have.
This guide is about building that mechanism at a realistic scale: what escalation actually is, why it matters more when you're small, and how to set up an alert chain that reliably reaches a human — without buying software designed for a 200-person engineering org.
An alert is a single event: one email, one SMS, one Slack message. It fires once and it's done. Whether a human actually saw it is not the alert's problem.
Escalation is a sequence with a feedback condition. It works like this: fire the first alert, then wait. If nobody acknowledges within a set time, fire a second alert — louder, different channel. Wait again. Still nothing? Fire the third. Keep going until a human explicitly acknowledges — clicks a link, replies to the SMS, presses a key on the call — at which point the chain stops.
The key word is acknowledges. A plain alert is satisfied when it has been sent. An escalation chain is satisfied only when a human has responded. That difference — sent versus responded — is the entire reason escalation exists, and it's why a single alert, however well configured, has a hole in it.
There are a few reasons escalation matters more for a one-person or small operation:
None of this requires a big-company solution. It requires a correctly shaped small one.
You don't need a complex routing tree. For a solo developer, a good chain has three or four steps, ordered from quiet-and-cheap to loud-and-certain.
Step 4 is the honest answer to "but I'm only one person." You don't need a rotation. You need exactly one fallback human, so the chain doesn't dead-end at you.
Once two or three people share responsibility, the change is to add a real on-call rotation: the chain routes to whoever is on duty this week, not always the same name. The principle is identical — quiet to loud, acknowledge to stop — but the "who" rotates so no single person carries every night. The handful of steps still look the same: notify on-call, escalate louder, voice call, then fall back to a second team member if the on-call person is unreachable.
| Step | Solo developer | Small team |
|---|---|---|
| 1 | Email + chat to you | Email + chat to on-call person |
| 2 | SMS to you | SMS to on-call person |
| 3 | Voice call to you | Voice call to on-call person |
| 4 | Fallback: one trusted contact | Fallback: second team member / lead |
When a small team decides to "take alerting seriously," the reflex is to reach for PagerDuty or Opsgenie. These are excellent products — but they are incident-management platforms built for organisations with many engineers, complex routing, and dedicated ops practices. For a solo developer or a team of three, adopting one usually means paying for — and configuring, and maintaining — far more capability than you'll use.
There's also a structural cost: PagerDuty doesn't monitor anything. It's the escalation layer only. So you end up running two products — a monitoring tool to detect the outage, and PagerDuty to escalate it — wired together, two bills, two configurations.
For most small operations, the simpler and cheaper answer is a monitoring tool that has escalation built in: one product that both detects the problem and runs the alert chain. You configure the monitors and the escalation in the same place, and there's nothing to integrate. You can always graduate to a dedicated incident-management platform later, when team size and complexity actually justify it. Most small teams never need to.
Escalation is not an enterprise luxury — it's most needed by the people with the fewest humans to catch a missed alert. A solo developer or small team doesn't need PagerDuty; they need a correctly shaped alert chain: a few steps ordered quiet to loud, each waiting for acknowledgment before the next fires, and — crucially — a final fallback to someone other than yourself. Build that, test it by ignoring a real alert on purpose, and an outage can no longer run silently just because one phone was off.
failover.io: uptime monitoring with built-in alert chains that climb until someone acknowledges. Free plan, no credit card.
Start monitoring free →