Blog
Stories, guides, and lessons from the world of observability
Follow us on X
If it ain't broke...
A Terraform lifecycle rule in the right place can help prevent a deadlock. But the same lifecycle rule in the wrong place?

mv aws-security-group shoot-foot
How you can run into an unplanned downtime while making a seemingly harmless change of renaming an AWS security group through Terraform?
Rescuing a SPAghetti React project
Practical tips for rescuing a SPAghetti React JS project. With confidence and a shared mental model, we made the codebase reliable and easier to manage.

One year at Last9
Celebrating one year at Last9! From uncertainty to growth, it's been an amazing journey with an inspiring team and exciting challenges.


Much That We Have Gotten Wrong About SRE
An illustrated summary of Developers ➡ DevOps ➡ SRE

Infrastructure-As-Code-As-Software
Explore how Infrastructure-as-Code-as-Software combines coding practices with automation to streamline infrastructure management and enhance scalability.

SLOs That Lie
Understanding how SLOs can help improve your performance and How to set the right Service Level Objectives for your application

Latency Percentiles are Incorrect P99 of the Times
What are P90, P95, and P99 latency? Why are they incorrect P99 of the times? Latency is for a unit of time and the preferred aggregate is percentile.

SRE Tooling – the Clever Hans fallacy
Chef or Ansible? Terraform or Pulumi? Python or Ruby? Last9 or Last9? Discover how building new tools links to the tale of a horse that could do math!