Jan 17th, ‘23/1 min read

When should I start thinking of observability?

How does one scale metrics maturity in a cloud-native world — A guide on observability tooling as your engineering org scales.

When should I start thinking of observability?

When you’re a 5-person engineering team, observability tooling is the least of your concerns. But putting it on the back burner only means prolonging the inevitable. It’s a question I get asked often,

“At which point in the maturity of an engineering org should you focus on reliability tooling?”.

“We’re flying blind for now, but diagnosing incidents is becoming harder — should I start considering tooling right away?”

I thought I’d write a whitepaper to understand how engineering organizations should build reliability and what factors to consider. And here we are.

As an engineering organization scales, your metrics grow across dimensions:

  1. Retention
  2. Unique metric types
  3. Cardinality, or instances

However, modern TSDBs grow on all three axes because of the rate of ingestion and exploration. 💥

So, how do you build reliability tooling across org tiers? We’ve classified organizations into three types based on the following:

  • 👩‍💻 Engineers
  • 😍 Customers
  • 📀 Services

Based on this classification, it’s easy to understand needs across teams. From my conversations, a point stood out: Data querying needs to grow plentiful across teams as organizations mature. So, this is not an engineering endeavor only. Business, Finance, Product, and Customer Success teams have what it takes to bring your Prometheus down. And these have broad ramifications for any organization.

Without further adieu, here’s a link to the whitepaper: 👇

You can also get started by signing up here.


Hit us up on Discord 😍 with any queries about Levitate. You can also ping me on Twitter for brickbats, suggestions, and questions: @realmeson10.

Newsletter

Stay updated on the latest from Last9.

Authors

Piyush Verma

Co-Founder at Last9

Handcrafted Related Posts