Much That We Have Gotten Wrong About SRE

An illustrated summary of Developers ➡ DevOps ➡ SRE

Contents

1. Developers wanted to ship their produce

To the other side

2. Production never matches the development environment. It resembles, but cannot match

So they deployed people on the other side

3. But this process was slow, they wanted to deploy faster

So they deployed Continuous-Deployment (CI/CD)

4. To improve reliability, we got SRE to do this

SREs’ first job was to hold this ship, but that’s all where they got stuck at

5. What Site Reliability Engineers should’ve built is

SREs should’ve been *engineering* and *observing* the bridge, but instead they became the bridge

About the authors
Piyush Verma

Piyush Verma

Co-Founder at Last9

Start observing for free. No lock-in.

OpenTelemetry · Prometheus

Just update your config. Start seeing data on Last9 in seconds.

Datadog · New Relic · Others

We've got you covered. Bring over your dashboards & alerts in one click.

Built on Open Standards

100+ integrations. OTel native, works with your existing stack.