All Topics / devops
DevOps Automation. Practices.
A checklist to choose a monitoring system
A detailed checklist of points you should consider before choosing a monitoring system
Why you need a Time Series Data Warehouse
What is a Time Series Data Warehouse? How does it help in your monitoring journey? How does it differ from a Time Series Database? That and more
SRE vs DevOps
What's the difference between SREs and DevOps professionals? How do they differ in their daily tasks?
MTTF vs MTBF vs MTTD vs MTTR
This article covers questions such as what are MTTF, MTBF, MTTD, and MTTR, their differences, how to adopt them, and their use cases.
Observability is dead, long live observability
No tool can magically offer you 99.999s. Observability is largely about the basics. And basics are boring. But, boring is hard. Boring is battle tested.
Introduction to DORA Metrics
DORA metrics, what they are, why they are important, and best practices for measuring them.
Challenges of Distributed Tracing
What are the challenges, benefits and use cases of distributed tracing?
How to restart Kubernetes Pods with kubectl
A query that keeps popping up, so decided to write a simple reckoner on how to restart a Kubernetes pod with kubectl
How to calculate HTTP content-length metrics on cli
A simple guide to crunch numbers for understanding overall HTTP content length metrics.
How to Improve On-Call Experience!
Better practices and tools for management of on-call practices
Getting the big picture with Log Analysis
How to get the most out of your logs!
Jayesh Bapu Ahire
Microservices - Tracking Dependencies
Quick primer into microservices architecture and the importance of tracking dependencies
Akshay Chugh, Jayesh Bapu Ahire
You can either love running or hate running, but you will definitely love this analogy - take a fresh look at SLOs!
Piyush Verma, Saurabh Hirani
If it ain't broke...
A Terraform lifecycle rule in the right place can help prevent a deadlock. But the same lifecycle rule in the wrong place?
mv aws-security-group shoot-foot
How you can run into an unplanned downtime while making a seemingly harmless change of renaming an AWS security group through Terraform?
We ran a poll on Twitter. “Do you care about the quality of your infrastructure code?” And on Reddit That’s an approximate and staggering 60–30–10 split. What do you think will the response be if the poll was — “Do you care about the quality of your product code?” Reasons We asked a follow-up question to reason why ~30% are in the Somewhat but mostly no category and gleaned these reasons from Twitter and Reddit: 1. Someone manually created the legacy infrastructure. No one questioned t
SRE Tooling – the Clever Hans fallacy
Chef or Ansible? Terraform or Pulumi? Python or Ruby? Last9 or Last9? What if we told you that the mindset of building new tools has an age old link to the story of a horse who could do arithmetic?