🏏 450 million fans watched the last IPL. What is 'Cricket Scale' for SREs? Know More

All Topics / devops

devops

DevOps Automation. Practices.

Everything in software monitoring is dead, apparently

Everything in software monitoring is dead, apparently

Chasing shiny new toys, as always ;)

Aniket Rao

A checklist to choose a monitoring system

A checklist to choose a monitoring system

A detailed checklist of points you should consider before choosing a monitoring system

Prathamesh Sonpatki

Why you need a Time Series Data Warehouse

Why you need a Time Series Data Warehouse

What is a Time Series Data Warehouse? How does it help in your monitoring journey? How does it differ from a Time Series Database? That and more

Rishi Agrawal

SRE vs DevOps

SRE vs DevOps

What's the difference between SREs and DevOps professionals? How do they differ in their daily tasks?

Last9

MTTF vs MTBF vs MTTD vs MTTR

MTTF vs MTBF vs MTTD vs MTTR

This article covers questions such as what are MTTF, MTBF, MTTD, and MTTR, their differences, how to adopt them, and their use cases.

Last9

Observability is dead, long live observability

Observability is dead, long live observability

No tool can magically offer you 99.999s. Observability is largely about the basics. And basics are boring. But, boring is hard. Boring is battle tested.

Aniket Rao

Introduction to DORA Metrics

Introduction to DORA Metrics

DORA metrics, what they are, why they are important, and best practices for measuring them.

Prathamesh Sonpatki

Challenges of Distributed Tracing

Challenges of Distributed Tracing

What are the challenges, benefits and use cases of distributed tracing?

Last9

How to restart Kubernetes Pods with kubectl

How to restart Kubernetes Pods with kubectl

A query that keeps popping up, so decided to write a simple reckoner on how to restart a Kubernetes pod with kubectl

Last9

How to calculate HTTP content-length metrics on cli

How to calculate HTTP content-length metrics on cli

A simple guide to crunch numbers for understanding overall HTTP content length metrics.

Saurabh Hirani

How to Improve On-Call Experience!

How to Improve On-Call Experience!

Better practices and tools for management of on-call practices

Prathamesh Sonpatki

Getting the big picture with Log Analysis

Getting the big picture with Log Analysis

How to get the most out of your logs!

Jayesh Bapu Ahire

Microservices - Tracking Dependencies

Microservices - Tracking Dependencies

Quick primer into microservices architecture and the importance of tracking dependencies

Akshay Chugh, Jayesh Bapu Ahire

SLOs eased

SLOs eased

You can either love running or hate running, but you will definitely love this analogy - take a fresh look at SLOs!

Piyush Verma, Saurabh Hirani

If it ain't broke...

If it ain't broke...

A Terraform lifecycle rule in the right place can help prevent a deadlock. But the same lifecycle rule in the wrong place?

Saurabh Hirani

mv aws-security-group shoot-foot

mv aws-security-group shoot-foot

How you can run into an unplanned downtime while making a seemingly harmless change of renaming an AWS security group through Terraform?

Saurabh Hirani

Infrastructure-As-Code-As-Software

Infrastructure-As-Code-As-Software

We ran a poll on Twitter. “Do you care about the quality of your infrastructure code?” And on Reddit That’s an approximate and staggering 60–30–10 split. What do you think will the response be if the poll was — “Do you care about the quality of your product code?” Reasons We asked a follow-up question to reason why ~30% are in the Somewhat but mostly no category and gleaned these reasons from Twitter and Reddit: 1. Someone manually created the legacy infrastructure. No one questioned t

Piyush Verma

SRE Tooling – the Clever Hans fallacy

SRE Tooling – the Clever Hans fallacy

Chef or Ansible? Terraform or Pulumi? Python or Ruby? Last9 or Last9? What if we told you that the mindset of building new tools has an age old link to the story of a horse who could do arithmetic?

Piyush Verma