
A case for Observability outside engineering teams
Observability is being built by engineers for engineers. In reality, o11y is for all.


Understanding the Rasmussen model for failures
What does the Rasmussen model teach us about Site Reliability Engineering?


How we tame High Cardinality by Sharding a stream
Using 'Sharding' to tame High Cardinality data for Levitate - Our Time Series Data Warehouse

Thanos vs. VictoriaMetrics
A deep dive comparison between Thanos and VictoriaMetrics: Performance and Differences


1979, a nuclear accident and SRE
Deep diving into the 'Normal accident' theory by Charles Perrow, and what it means for SREs


Ingest OpenTelemetry metrics with Prometheus natively
Native support for OpenTelemetry metrics in Prometheus


How we tame high cardinality in time series databases
Engineering innovation to solve high cardinality with Levitate - a multi-part series


InfluxDB vs. Thanos
InfluxDB vs Thanos: Overview, Pros and Cons, and Differences


What Site Reliability Engineering Needs: A Swarm of Bees
If all companies are software companies, all companies need better Observability to understand how performative their software is
