All Topics / software engineering
This arctic winter — time to repay your tech debt
We're in a peak tech winter. What should engineering teams focus on when product velocity dwindles?
A case for Observability outside engineering teams
Observability is being built by engineers for engineers. In reality, o11y is for all.
Observability is a practice, not a job
Engineering organizations that ship fast have Observability as part of their core DNA.
SRE vs Platform Engineering
What's the difference between SREs and Platform Engineers? How do they differ in their daily tasks?
SRE vs DevOps
What's the difference between SREs and DevOps professionals? How do they differ in their daily tasks?
High Cardinality for Dummies: ELI5
High Cardinality woes are far & frequent in today's modern cloud-native environment. What does it mean, & why is it such a pressing problem?
Mohan Dutt Parashar
Who should define Reliability — Engineering, or Product?
Whoever owns Reliability should define its parameters. But who owns the Reliability of a Product? Engineering? Product Management? Or the Customer success team?
What do self-driving cars tell us about Site Reliability Engineering?
From Robocars to Reliability — SRE with self-driving cars; mapping out where the Observability space is in conjunction with self-driving cars
Mohan Dutt Parashar
What is MTBI?
Everything you need to know about Mean Time Between Incidents (MTBI) and how it can help Site Reliability Engineers
SLA vs SLO vs SLI - What's the difference
What's the difference between SLAs vs SLOs vs SLIs. Understanding these little nuances are critical for DevOps folks. Here's a simple reckoner on what each of these mean
The importance of structured communication in the world of SRE
How you communicate helps build your 9s. In the world of Site Reliability Engineering, this is crucial. How do you do it?
The difference between DevOps, SRE, and Platform Engineering
In reliability engineering, three concepts keep getting talked about - DevOps, SRE and Platform Engineering. How do they differ?
Introduction to DORA Metrics
DORA metrics, what they are, why they are important, and best practices for measuring them.
Battling Alert Fatigue
What is Alert Fatigue and techniques to reduce it
Challenges of Distributed Tracing
What are the challenges, benefits and use cases of distributed tracing?
Latency is the new downtime
In the early days of Google, a lot of users were asking for 30 results on the first page of search results. So after long deliberation, Marissa Mayer, then the Product Manager for google.com, decided to run the A/B test for ten vs 30 results. When the results came in, they were in for a surprise.
We’ve raised a $11M Series A led by Sequoia Capital India!
Change is the only constant in a cloud environment. The number of microservices is constantly growing, and each is being deployed several times a day or week, all hosted on ephemeral servers. A typical customer request depends on at least three internal and one external service. It’s a densely connected web of systems. Any change in such a connected system usually introduces a ripple. It’s tough to understand these impacts. Alert fatigue, tribal knowledge of failures, and manual correlation acro
How to Improve On-Call Experience!
Better practices and tools for management of on-call practices
Running a Database on EC2 is Slowing It Down
Learn everything about the advantages of EC2, it's use cases and how to optimize EC2 further.
Jayesh Bapu Ahire, Akshay Chugh
The most interesting talks from SRECon 2021!
SRECon is a conference hosted by USENIX and is focused on site reliability, distributed systems, and systems engineering at scale. Learn about some of the most interesting talks from SRECon 2021.
Microservices - Tracking Dependencies
Quick primer into microservices architecture and the importance of tracking dependencies
Akshay Chugh, Jayesh Bapu Ahire
Monorepos - The Good, Bad, and Ugly
A monorepo is a single version control repository that holds all the code, configuration files, and components required for your project (including services like search) and it’s how most projects start. However, as a project grows, there is debate as to whether the project's code should be split into multiple repositories. In many cases, monorepos are still useful since they are very effective at managing projects with a lot of individual components. They also ensure that anyone working on a p