Blog | Last9

Battling Alert Fatigue

What is Alert Fatigue and techniques to reduce it

Last9

Guide to Service Level Indicators and Setting Service Level Objectives

SLOs, SLIs, and SLAs: Understanding Key Service Metrics

A guide to set practical Service Level Objectives (SLOs) & Service Level Indicators (SLIs) for your Site Reliability Engineering practices.

Read

Last9

Kubernetes Monitoring with Prometheus and Grafana

A guide to help you implement Prometheus and Grafana in your Kubernetes cluster

Read

Last9

Why We Auto-Delete Slack Messages at Last9

At Last9, we auto-delete Slack DMs after 2 days. This pushes teams to improve documentation, reduce tribal knowledge, and own accountability.

Read

Nishant Modak

Static Threshold vs. Dynamic Threshold Alerting

What's the difference between Static Threshold vs Dynamic Threshold Alerting? Do you really know when and how to use each threshold type?

Read

Last9

Why MTTR should be a ‘business’ metric

A key challenge is aligning engineering health metrics with business goals. How can business measure engineering, and engineering show its value?

Read

Sidu Ponnappa

Observability - That Last 9

TL;DR: A stitch in time, saves 9. A discussion on the key blocks of observability.

Read

Akash Saxena

How we won Dukaan over

5 meetings. 1 month. Subhash and his team’s velocity on decision-making, moving fast, and radical candor, are a breath of fresh air in the Indian startup ecosystem.

Read

Aniket Rao

Sample vs Metrics vs Cardinality

When dealing with Time Series databases, I always got confused with Sample vs Metrics vs Cardinality. Here’s an explanation as I have understood it.

Read

Piyush Verma