Nov 18th, 2020

Much That We Have Gotten Wrong About SRE

An illustrated summary of Developers ➡ DevOps ➡ SRE

Contents

1. Developers wanted to ship their produce

To the other side

2. Production never matches the development environment. It resembles, but cannot match

So they deployed people on the other side

3. But this process was slow, they wanted to deploy faster

So they deployed Continuous-Deployment (CI/CD)

4. To improve reliability, we got SRE to do this

SREs’ first job was to hold this ship, but that’s all where they got stuck at

5. What Site Reliability Engineers should’ve built is

SREs should’ve been *engineering* and *observing* the bridge, but instead they became the bridge

About the authors
Piyush Verma

Piyush Verma

Co-Founder at Last9

Last9 keyboard illustration

Start observing for free. No lock-in.

OPENTELEMETRY • PROMETHEUS

Just update your config. Start seeing data on Last9 in seconds.

DATADOG • NEW RELIC • OTHERS

We've got you covered. Bring over your dashboards & alerts in one click.

BUILT ON OPEN STANDARDS

100+ integrations. OTel native, works with your existing stack.

Gartner Cool Vendor 2025 Gartner Cool Vendor 2025
High Performer High Performer
Best Usability Best Usability
Highest User Adoption Highest User Adoption