Product

Discover
Auto-discover everything you run & trace problems to the root cause—fast

Services Kubernetes Jobs Hosts

Explore

Stream & analyze millions of events per minute, all correlated

Logs Traces Metrics

Control Plane
First-class DX to manage incoming telemetry data in real-time

Ingestion Storage Query Analytics

AI
Natural language insights & debugging in your IDE

RUM
Monitoring with business context

Alerting
For high-cardinality environments
Resources

Guides
Comprehensive docs for engineers building large-scale applications

OpenTelemetry High Cardinality Prometheus LogQL

Blog
Stories, guides, and lessons from the world of observability

Events
SRE & DevOps sharing meets

Changelog
Updates and improvements
Customers
Docs
Pricing
Book demo

Blog illustration

Blog

Stories, guides, and lessons from the world of observability

CtrlK

OSS vs Paid vs Managed OSS — Picking what works for your Observability journey

Observability—OSS vs Paid vs Managed OSS

The Reliability industry needs a managed, non-vendor lock-in answer to spiraling costs, high cardinality and the toil of managing a tsdb

Satyajeet Jadhav

Learnings integrating jmxtrans with Levitate

Learnings integrating jmxtrans

JMX metrics give solid insights into the workings of your application. Integrating them with Last9 (our time series data warehosue) required us to jump some hoops with vmagent.

Saurabh Hirani

MTTF vs MTBF vs MTTD vs MTTR

MTTF vs MTBF vs MTTD vs MTTR

This article covers questions such as what are MTTF, MTBF, MTTD, and MTTR, their differences, how to adopt them, and their use cases.

Last9

The neglected tech arctic winter — Internal SaaS expenses

The neglected tech arctic winter — Internal SaaS expenses

The current tech winter reveals a hard truth: spending on internal tools for tech infrastructure is bloated—and this isn't just a passing cycle.

Nishant Modak

Recap of SRECon Americas 2023

Recap of SRECon Americas 2023

SRECon is a conference hosted by USENIX and is focused on site reliability, distributed systems, and systems engineering at scale. A Recap of SRECon Americas 2023.

Last9

What does "Cricket scale" mean for a Site Reliability Engineer?

Understanding “Cricket Scale”

How does a DevOps/Site Reliability Engineer plan for "Cricket scale"? How do you warm systems' about to witness 30+ million concurrent users?

Aniket Rao

What is MTBI?

What is MTBI?

Everything you need to know about Mean Time Between Incidents (MTBI) and how it can help Site Reliability Engineers

Last9

Reliability Engineering for Dummies: ELI5

Reliability Engineering for Dummies: ELI5

Explaining Reliability Engineering to a 5-year-old.

Mohan Dutt Parashar

Difference between SLA Vs SLO

SLA vs SLO vs SLI - What's the difference

SLAs, SLOs, and SLIs—what’s the difference? For DevOps folks, understanding these nuances is key. Here's a quick guide to each term.

Last9