Last9 engineering illustration

Last9 engineering

All articles tagged 'Last9 engineering'

Extracting Account-Level CDN Metrics from Akamai Logs with Last9: A Practical Guide

Extracting Account-Level CDN Metrics from Akamai Logs with Last9

Learn how to extract and analyze account-level CDN metrics from Akamai logs using Last9 for real-time insights and better customer tracking.

Read
Prathamesh Sonpatki

Prathamesh Sonpatki

Aditya Godbole

Aditya Godbole

Last9’s Single Pane for High Cardinality Observability

Last9’s Single Pane for High Cardinality Observability

Last9’s Telemetry Warehouse now supports Logs and Traces, offering a unified view for high cardinality observability to simplify monitoring and troubleshooting.

Read
Sahil Khan

Sahil Khan

Think Data Warehouse, NOT Database.

Think Data Warehouse, NOT Database.

The software monitoring world is broken because of a TSDB. We deserve a TSDW

Read
Aniket Rao

Aniket Rao

What needs to change in software monitoring?

What needs to change in software monitoring?

A wishlist of things that need to change in the world of software monitoring

Read
Aniket Rao

Aniket Rao

Back to the Future: The R-C-A of alerting

Back to the Future: The R-C-A of alerting

Dissecting the RCA of Alerting - Reliability, Correlations, Actionability

Read
Aditya Godbole

Aditya Godbole

Software Monitoring — Stuck in the 00s

Software Monitoring — Stuck in the 00s

A short history of software monitoring, from the 00s. What has changed? Why are things so arcane?

Read
Piyush Verma

Piyush Verma

Why your monitoring costs are high and how you can reduce them with Levitate

Why your monitoring costs are high

If you want to bring down your monitoring costs, you need to shake up a decision paralysis in engineering

Read
Aniket Rao

Aniket Rao

Deliver all your orders this December 31st 😉

The unresolved cost of High Cardinality

Fulfill all your food delivery orders this December 31st by taming High Cardinality data with Levitate 😉

Read
Prathamesh Sonpatki

Prathamesh Sonpatki

A Time Series Data Warehouse vs A Time Series Database

Why you need a Time Series Data Warehouse

What is a Time Series Data Warehouse? How does it help in your monitoring journey? How does it differ from a Time Series Database? That and more

Read
Rishi Agrawal

Rishi Agrawal

Real-Time Canary Deployment Tracking with Argo CD & Levitate

Real-Time Canary Deployment Tracking with Argo CD & Last9

Use Levitate's powerful change events to track success of canary rollouts via ArgoCD

Read
Preeti Dewani

Preeti Dewani

Monitor Google Cloud Functions using Pushgateway and Levitate

Monitor Google Cloud Functions using Pushgateway and Levitate

How to monitor serverless async jobs from Google Cloud Functions with Prometheus Pushgateway and Levitate using the push model

Read
Aniket Rao

Aniket Rao

Golang Concurrency Masterclass by Swati Modi at Gophercon 2023

Golang Concurrency Masterclass by Swati Modi at Gophercon 2023

Talk on Golang Concurrency Masterclass by Swati Modi at Gophercon 2023

Read
Last9

Last9

Do more with your metrics by Piyush Verma at GopherConIndia 2022

Do more with your metrics by Piyush Verma

Piyush Verma's talk at GopherCon India 2022 on Do More with Your Metrics with Last9 and Levitate

Read
Last9

Last9

Unwiring High Cardinality - SRE Day 2023

Unwiring High Cardinality - SRE Day 2023

Report from SRE Day 2023, where Piyush Verma - CTO Last9, gave a talk on Unwiring High Cardinality

Read
Last9

Last9

How to restart Kubernetes Pods with kubectl

How to restart Kubernetes Pods with kubectl

A simple reckoner on how to restart a Kubernetes pod with kubectl

Read
Anjali Udasi

Anjali Udasi

Levitate: Last9’s Managed TSDB Now on AWS Marketplace

Levitate: Last9’s Managed TSDB Now on AWS Marketplace

Levitate - Last9's managed Prometheus Compatible TSDB is available on AWS Marketplace

Read
Prathamesh Sonpatki

Prathamesh Sonpatki

Standardize PromQL with Macros

PromQL Macros in Levitate

Define PromQL Macros to standardize complex PromQL queries in Levitate

Read
Prathamesh Sonpatki

Prathamesh Sonpatki

GCP Managed Service For Prometheus vs. Levitate

GCP Managed Service For Prometheus vs. Levitate

A detailed comparison of Levitate and Google Managed Prometheus - Cost, Scale and Ease of Use

Read
Prathamesh Sonpatki

Prathamesh Sonpatki

A case for Observability outside engineering teams

A case for Observability outside engineering teams

Observability is being built by engineers for engineers. In reality, o11y is for all.

Read
Aniket Rao

Aniket Rao

Understanding the Rasmussen model for failures

Understanding the Rasmussen model for failures

What does the Rasmussen model teach us about Site Reliability Engineering?

Read
Nishant Modak

Nishant Modak

How we tame High Cardinality by Sharding a stream

How we tame High Cardinality by Sharding a stream

Using 'Sharding' to tame High Cardinality data for Levitate - Our Time Series Data Warehouse

Read
Piyush Verma

Piyush Verma

1979, a nuclear accident and SRE

1979, a nuclear accident and SRE

Deep diving into the 'Normal accident' theory by Charles Perrow, and what it means for SREs

Read
Aniket Rao

Aniket Rao

How we tame high cardinality in time series databases: Part 1

How we tame high cardinality in time series databases

Engineering innovation to solve high cardinality with Levitate - a multi-part series

Read
Piyush Verma

Piyush Verma

Swati Modi

Swati Modi

What Site Reliability Engineering needs — A swarm of rogue bees

What Site Reliability Engineering Needs: A Swarm of Bees

If all companies are software companies, all companies need better Observability to understand how performative their software is

Read
Aniket Rao

Aniket Rao

Take back control of your Monitoring with Levitate

Take back control of your Monitoring

Take back control of your Monitoring with Levitate - a managed time series data warehouse

Read
Nishant Modak

Nishant Modak

Observability is a practice, not a job

Observability is a practice, not a job

Engineering organizations that ship fast have Observability as part of their core DNA.

Read
Aniket Rao

Aniket Rao

Using a Golang package in Python using Gopy

Using a Golang package in Python using Gopy

Using Golang package in Python using Gopy: A simple way to leverage the power of Golang packages in Python applications.

Read
Arjun Mahishi

Arjun Mahishi

Who should define Reliability —  Engineering, or Product

Who should define Reliability — Engineering, or Product?

Whoever owns Reliability should define its parameters. But who owns the Reliability of a Product? Engineering? Product Management? Or the Customer success team?

Read
Piyush Verma

Piyush Verma

OSS vs Paid vs Managed OSS — Picking what works for your Observability journey

Observability—OSS vs Paid vs Managed OSS

The Reliability industry needs a managed, non-vendor lock-in answer to spiraling costs, high cardinality and the toil of managing a tsdb

Read
Satyajeet Jadhav

Satyajeet Jadhav

Learnings integrating jmxtrans with Levitate

Learnings integrating jmxtrans

JMX metrics give solid insights into the workings of your application. Integrating them with Levitate (our time series data warehosue) required us to jump some hoops with vmagent.

Read
Saurabh Hirani

Saurabh Hirani

The neglected tech arctic winter — Internal SaaS expenses

The neglected tech arctic winter — Internal SaaS expenses

The current tech winter reveals a hard truth: spending on internal tools for tech infrastructure is bloated—and this isn't just a passing cycle.

Read
Nishant Modak

Nishant Modak

What does "Cricket scale" mean for a Site Reliability Engineer?

Understanding “Cricket Scale”

How does a DevOps/Site Reliability Engineer plan for "Cricket scale"? How do you warm systems' about to witness 30+ million concurrent users?

Read
Aniket Rao

Aniket Rao

What is MTBI?

What is MTBI?

Everything you need to know about Mean Time Between Incidents (MTBI) and how it can help Site Reliability Engineers

Read
Last9

Last9

Do your alerting tools improve outcomes for Business?

Rethinking Anomaly Detection: Focus on business outcomes

From the trenches at Games24x7 — Sanjay, on how Reliability engineering should drive core business metrics

Read
Sanjay Singh

Sanjay Singh

A good chunk of SRE woes can be traced back to the stronghold tribal knowledge across teams 😵‍💫

Observability is dead, long live observability

No tool can magically offer you 99.999s. Observability is largely about the basics. And basics are boring. But, boring is hard. Boring is battle tested.

Read
Aniket Rao

Aniket Rao

Self-managed Prometheus vs Managed Prometheus

Self-managed Prometheus vs Managed Prometheus

What are the differences between Self-managed Prometheus vs Managed prometheus? How do you choose what works for you?

Read
Last9

Last9

The importance of structured communication in the world of SRE

The importance of structured communication in the world of SRE

How you communicate helps build your 9s. In the world of Site Reliability Engineering, this is crucial. How do you do it?

Read
Saurabh Hirani

Saurabh Hirani

The difference between DevOps, SRE, and Platform Engineering

The difference between DevOps, SRE, and Platform Engineering

In reliability engineering, three concepts keep getting talked about - DevOps, SRE and Platform Engineering. How do they differ?

Read
Prathamesh Sonpatki

Prathamesh Sonpatki

Golang's Stringer tool

Golang's Stringer tool

Learn about how to use, extend and auto-generate Stringer tool of Golang

Read
Arjun Mahishi

Arjun Mahishi

How to improve Prometheus remote write performance at scale

How to improve Prometheus remote write performance at scale

Deep dive into how to improve the performance of Prometheus Remote Write at Scale based on real-life experiences

Read
Saurabh Hirani

Saurabh Hirani

India vs Pakistan: SRE and the Shannon Limit

India vs Pakistan: SRE and the Shannon Limit

How does one ‘detect change’ in a complex infrastructure, so you don’t lose out on critical revenues — A short SRE story

Read
Satyajeet Jadhav

Satyajeet Jadhav

Battling Alert Fatigue

Battling Alert Fatigue

What is Alert Fatigue and techniques to reduce it

Read
Last9

Last9

Guide to Service Level Indicators and Setting Service Level Objectives

SLOs, SLIs, and SLAs: Understanding Key Service Metrics

A guide to set practical Service Level Objectives (SLOs) & Service Level Indicators (SLIs) for your Site Reliability Engineering practices.

Read
Last9

Last9

Kubernetes Monitoring with Prometheus and Grafana

Kubernetes Monitoring with Prometheus and Grafana

A guide to help you implement Prometheus and Grafana in your Kubernetes cluster

Read
Last9

Last9

Why We Auto-Delete Slack Messages at Last9

Why We Auto-Delete Slack Messages at Last9

At Last9, we auto-delete Slack DMs after 2 days. This pushes teams to improve documentation, reduce tribal knowledge, and own accountability.

Read
Nishant Modak

Nishant Modak

Static Threshold vs. Dynamic Threshold Alerting

Static Threshold vs. Dynamic Threshold Alerting

What's the difference between Static Threshold vs Dynamic Threshold Alerting? Do you really know when and how to use each threshold type?

Read
Last9

Last9

How we won Dukaan over

How we won Dukaan over

5 meetings. 1 month. Subhash and his team’s velocity on decision-making, moving fast, and radical candor, are a breath of fresh air in the Indian startup ecosystem.

Read
Aniket Rao

Aniket Rao

How to calculate HTTP content-length metrics on cli

How to calculate HTTP content-length metrics on cli

A simple guide to crunch numbers for understanding overall HTTP content length metrics.

Read
Saurabh Hirani

Saurabh Hirani

Choosing Effective SLIs

Choosing Effective SLIs

Practical advice to choose an effective SLI.

Read
Akshay Chugh

Akshay Chugh

Running a Database on EC2 is Slowing It Down

Running a Database on EC2 is Slowing It Down

Learn everything about the advantages of EC2, it's use cases and how to optimize EC2 further.

Read
Jayesh Bapu Ahire

Jayesh Bapu Ahire

Akshay Chugh

Akshay Chugh

Doing SRE the Right Way!

Doing SRE the Right Way!

A well-thought-out approach to SRE, which will help site reliability engineers and software engineers develop and maintain a useful, consistent, and effective SRE strategy for their products!

Read
Piyush Verma

Piyush Verma

Microservices - Tracking Dependencies

Microservices - Tracking Dependencies

Quick primer into microservices architecture and the importance of tracking dependencies

Read
Akshay Chugh

Akshay Chugh

Jayesh Bapu Ahire

Jayesh Bapu Ahire

SLOs eased

SLOs eased

You can either love running or hate running, but you will definitely love this analogy - take a fresh look at SLOs!

Read
Piyush Verma

Piyush Verma

Saurabh Hirani

Saurabh Hirani

Rescuing a SPAghetti React project

Rescuing a SPAghetti React project

Practical tips for rescuing a SPAghetti React JS project. With confidence and a shared mental model, we made the codebase reliable and easier to manage.

Read
Prathamesh Sonpatki

Prathamesh Sonpatki

One year at Last9

One year at Last9

Celebrating one year at Last9! From uncertainty to growth, it's been an amazing journey with an inspiring team and exciting challenges.

Read
Prathamesh Sonpatki

Prathamesh Sonpatki