Product

Discover
Auto-discover everything you run & trace problems to the root cause—fast

Services Kubernetes Jobs Hosts

Explore

Stream & analyze millions of events per minute, all correlated

Logs Traces Metrics

Control Plane
First-class DX to manage incoming telemetry data in real-time

Ingestion Storage Query Analytics

AI
Natural language insights & debugging in your IDE

RUM
Monitoring with business context

Alerting
For high-cardinality environments
Resources

Guides
Comprehensive docs for engineers building large-scale applications

OpenTelemetry High Cardinality Prometheus LogQL

Blog
Stories, guides, and lessons from the world of observability

Events
SRE & DevOps sharing meets

Changelog
Updates and improvements
Customers
Docs
Pricing
Book demo

Blog illustration

Blog

Stories, guides, and lessons from the world of observability

CtrlK

The importance of structured communication in the world of SRE

The importance of structured communication in the world of SRE

How you communicate helps build your 9s. In the world of Site Reliability Engineering, this is crucial. How do you do it?

Saurabh Hirani

Best Practices Using and Writing Prometheus Exporters

Best Practices Using and Writing Prometheus Exporters

This article will go over what Prometheus exporters are, how to properly find and utilize prebuilt exporters, and tips, examples, and considerations when building your own exporters.

Last9

The difference between DevOps, SRE, and Platform Engineering

The difference between DevOps, SRE, and Platform Engineering

In reliability engineering, three concepts keep getting talked about - DevOps, SRE and Platform Engineering. How do they differ?

Prathamesh Sonpatki

Thanos v/s Cortex

Thanos vs Cortex

In-depth comparison of Cortex and Thanos, what specifically they help teams do, challenges in implementing both, and how to think about what’s right for your team.

Sahil Khan

Introduction to DORA Metrics

Introduction to DORA Metrics

DORA metrics, what they are, why they are important, and best practices for measuring them.

Prathamesh Sonpatki

Golang's Stringer tool

Golang's Stringer tool

Learn about how to use, extend and auto-generate Stringer tool of Golang

Arjun Mahishi

How to improve Prometheus remote write performance at scale

How to improve Prometheus remote write performance at scale

Deep dive into how to improve the performance of Prometheus Remote Write at Scale based on real-life experiences

Saurabh Hirani

Prometheus vs InfluxDB

Prometheus vs InfluxDB: Side-by-Side Comparison

What are the differences between Prometheus and InfluxDB - use cases, challenges, advantages and how you should go about choosing the right tsdb

Anjali Udasi

India vs Pakistan: SRE and the Shannon Limit

India vs Pakistan: SRE and the Shannon Limit

How does one ‘detect change’ in a complex infrastructure, so you don’t lose out on critical revenues — A short SRE story

Satyajeet Jadhav