Blog | Last9

What does "Cricket scale" mean for a Site Reliability Engineer?

Understanding “Cricket Scale”

How does a DevOps/Site Reliability Engineer plan for "Cricket scale"? How do you warm systems' about to witness 30+ million concurrent users?

Read

Aniket Rao

Mar 23, 2023

What is MTBI?

Everything you need to know about Mean Time Between Incidents (MTBI) and how it can help Site Reliability Engineers

Read

Last9

Mar 20, 2023

Reliability Engineering for Dummies: ELI5

Explaining Reliability Engineering to a 5-year-old.

Read

Mohan Dutt Parashar

Mar 9, 2023

SLA vs SLO vs SLI - What's the difference

SLAs, SLOs, and SLIs—what’s the difference? For DevOps folks, understanding these nuances is key. Here's a quick guide to each term.

Read

Last9

Mar 7, 2023

Do your alerting tools improve outcomes for Business?

Rethinking Anomaly Detection: Focus on business outcomes

From the trenches at Games24x7 — Sanjay, on how Reliability engineering should drive core business metrics

Read

Sanjay Singh

Feb 16, 2023

Interesting talks on Observability from Fosdem 2023

A recap of the talks from the Observability and Monitoring dev room at Fosdem 2023.

Read

Prathamesh Sonpatki

Feb 14, 2023

Comparing Popular Service Mesh Offerings

An in-depth look at several service mesh offerings and comparison based on their features, licensing and pricing, architecture, and user experience.

Read

Last9

Feb 3, 2023

Prometheus Monitoring

Prometheus is a popular open-source monitoring system. In this blog, we'll cover the basics of Prometheus monitoring, including its architecture, key features, and alternatives.

Read

Last9

Jan 31, 2023

A good chunk of SRE woes can be traced back to the stronghold tribal knowledge across teams 😵‍💫

Observability is dead, long live observability

No tool can magically offer you 99.999s. Observability is largely about the basics. And basics are boring. But, boring is hard. Boring is battle tested.

Read

Aniket Rao

Jan 19, 2023