All Topics / monitoring
monitoring
![Think Data Warehouse, NOT Database.](https://last9.ghost.io/content/images/2024/07/Data-Retention-and-Management-2.jpg)
Think Data Warehouse, NOT Database.
The software monitoring world is broken because of a TSDB. We deserve a TSDW
Aniket Rao
![Building monitoring by auto discovering resources for 70+ microservices](https://last9.ghost.io/content/images/2024/06/Auto-Discovery-tool-story.jpg)
Building monitoring by auto discovering resources for 70+ microservices
The promise of a managed SaaS partner — Reducing monitoring costs at all costs
Preeti Dewani
![What needs to change in software monitoring?](https://last9.ghost.io/content/images/2024/06/What-needs-to-change-in-Software-monitoring.jpg)
What needs to change in software monitoring?
A wishlist of things that need to change in the world of software monitoring
Aniket Rao
![How we reduced monitoring costs and deprecated Thanos for Replit](https://last9.ghost.io/content/images/2024/06/blog-5.png)
How we reduced monitoring costs and deprecated Thanos for Replit
Winning Replit over by taming High Cardinality data and deprecating Thanos
Prathamesh Sonpatki
![Software Monitoring — Stuck in the 00s](https://last9.ghost.io/content/images/2024/03/Software-monitoring.jpg)
Software Monitoring — Stuck in the 00s
A short history of software monitoring, from the 00s. What has changed? Why are things so arcane?
Piyush Verma
![A checklist to choose a monitoring system](https://last9.ghost.io/content/images/2024/02/A-checklist-to-choose-a-monitoring-system.jpg)
A checklist to choose a monitoring system
A detailed checklist of points you should consider before choosing a monitoring system
Prathamesh Sonpatki
![Controlling Kubernetes Costs with OpenCost and Levitate](https://last9.ghost.io/content/images/2024/02/opencost-with-levitate--1-.png)
Controlling Kubernetes Costs with OpenCost and Levitate
Setting up OpenCost with Levitate to monitor the cost of Kubernetes clusters
Aniket Rao
![Why your monitoring costs are high](https://last9.ghost.io/content/images/2024/01/Why-your-monitoring-costs-are-high.jpg)
Why your monitoring costs are high
If you want to bring down your monitoring costs, you need to shake up a decision paralysis in engineering
Aniket Rao
![Prometheus Metrics Types - A Deep Dive](https://last9.ghost.io/content/images/2023/12/prathamesh_s_a_small_kid_looking_at_the_large_map_intricately_g_d61f9f70-280a-498d-88ea-3ca87b17db8e--1-.png)
Prometheus Metrics Types - A Deep Dive
A deep dive on different metric types in Prometheus and best practices
Tripad Mishra
![Monitor Cloudflare Workers using Prometheus Exporter](https://last9.ghost.io/content/images/2023/12/prathamesh_s_clouds_on_a_sunny_new_york_morning_and_a_young_kid_c45ec021-ea9f-4da1-b488-30542253f337--1-.png)
Monitor Cloudflare Workers using Prometheus Exporter
Complete guide to monitor Cloudflare workers using Prometheus Exporter
Aniket Rao
![Why you need a Time Series Data Warehouse](https://last9.ghost.io/content/images/2023/12/Why-you-need-a-Time-Series-Data-Warehouse---blog-article.jpg)
Why you need a Time Series Data Warehouse
What is a Time Series Data Warehouse? How does it help in your monitoring journey? How does it differ from a Time Series Database? That and more
Rishi Agrawal
![How To Instrument Golang app using OpenTelemetry - Tutorial & Best Practices](https://last9.ghost.io/content/images/2023/11/rassmussen_an_ant_observing_from_the_scope_of_a_large_telescope_13da3f48-359a-434a-babc-061dbc014a13--1-.png)
How To Instrument Golang app using OpenTelemetry - Tutorial & Best Practices
A comprehensive guide to instrument Golang applications using OpenTelemetry libraries for metrics and traces
Last9
![Building Logs to Metrics pipelines with Vector](https://last9.ghost.io/content/images/2023/11/vector_logs_to_metrics--1---1-.png)
Building Logs to Metrics pipelines with Vector
How to build a pipeline to convert logs to metrics and ship them to long term storage like Levitate
Aniket Rao
![SaaS Monitoring with Levitate](https://last9.ghost.io/content/images/2023/11/saas_monitoring_with_levitate--1-.png)
SaaS Monitoring with Levitate
How Levitate solves today's challenges of B2B SaaS monitoring, including noisy neighbors by unlocking per-tenant observability
Prathamesh Sonpatki
![Troubleshooting Common Prometheus Pitfalls: Cardinality, Resource Utilization, and Storage Challenges](https://last9.ghost.io/content/images/2023/11/troubleshooting_prometheus_pitfalls--1-.png)
Troubleshooting Common Prometheus Pitfalls: Cardinality, Resource Utilization, and Storage Challenges
Common Prometheus pitfalls and ways to handle them
Last9
![Downsampling & Aggregating Metrics in Prometheus: Practical Strategies to Manage Cardinality and Query Performance](https://last9.ghost.io/content/images/2023/11/Prometheus-Downsampling-Metrics-Time-Series.jpg)
Downsampling & Aggregating Metrics in Prometheus: Practical Strategies to Manage Cardinality and Query Performance
A comprehensive guide to downsampling metrics data in Prometheus with alternate robust solutions
Last9
![Mastering Prometheus Relabeling: A Comprehensive Guide](https://last9.ghost.io/content/images/2023/11/How_relabeling_in_Prometheus_works.jpg)
Mastering Prometheus Relabeling: A Comprehensive Guide
A comprehensive guide to relabeling strategies in Prometheus
Last9
![Real-Time Canary Deployment Tracking with Argo CD & Levitate Change Events](https://last9.ghost.io/content/images/2023/11/Canary-deployment.png)
Real-Time Canary Deployment Tracking with Argo CD & Levitate Change Events
Use Levitate's powerful change events to track success of canary rollouts via ArgoCD
Preeti Dewani
![Monitor Google Cloud Functions using Pushgateway and Levitate](https://last9.ghost.io/content/images/2023/11/gcp-cloud-functions-to-levitate.jpg)
Monitor Google Cloud Functions using Pushgateway and Levitate
How to monitor serverless async jobs from Google Cloud Functions with Prometheus Pushgateway and Levitate using the push model
Aniket Rao
![Prometheus vs. ELK](https://last9.ghost.io/content/images/2023/11/prometheus_vs_elk--1-.png)
Prometheus vs. ELK
Comparison and differences between Prometheus and ELK
Last9
![What is Thanos and How Does it Scale Prometheus?](https://last9.ghost.io/content/images/2023/11/rassmussen_Ghibli_style_illustration_of_miniature_marvel_charac_1d7be56b-6480-4c28-beeb-849eabb81e94--1-.png)
What is Thanos and How Does it Scale Prometheus?
A guide on what is Thanos and how it can be used with Prometheus
Last9
![A case for Observability outside engineering teams](https://last9.ghost.io/content/images/2023/11/prathamesh_s_An_illustration_image_of_two_halves-1--1---1-.png)
A case for Observability outside engineering teams
Observability is being built by engineers for engineers. In reality, o11y is for all.
Aniket Rao
![Understanding the Rasmussen model for failures](https://last9.ghost.io/content/images/2023/11/Rasmussen--1-.png)
Understanding the Rasmussen model for failures
What does the Rasmussen model teach us about Site Reliability Engineering?
Nishant Modak
![Observability vs. Telemetry vs. Monitoring](https://last9.ghost.io/content/images/2023/08/Observability_vs_Telemetry_vs_Monitoring.png)
Observability vs. Telemetry vs. Monitoring
Observability vs Telemetry vs Monitoring - What they are, differences and what lies in future
Last9
![What is OpenTelemetry Collector](https://last9.ghost.io/content/images/2023/07/What-is-OpenTelemetry-Collector-1.png)
What is OpenTelemetry Collector
What is OpenTelemetry Collector, Architecture, Deployment and Getting started
Last9
![What is High Cardinality](https://last9.ghost.io/content/images/2023/06/what-is-high-cardinality.jpg)
What is High Cardinality
Overview of what is high cardinality in the context of monitoring using Prometheus and Grafana
Prathamesh Sonpatki
![What is OpenTelemetry](https://last9.ghost.io/content/images/2023/06/what-is-opentelemetry.jpg)
What is OpenTelemetry
Learn what is OpenTelemetry: The open-source observability framework for collecting and processing telemetry data from applications and systems.
Last9
![How to Manage High Cardinality Metrics in Prometheus](https://last9.ghost.io/content/images/2023/06/prometheus-high-cardinality.jpg)
How to Manage High Cardinality Metrics in Prometheus
A comprehensive guide on understanding high cardinality Prometheus metrics, proven ways to find high cardinality metrics and manage them.
Last9
![Prometheus Operator Guide](https://last9.ghost.io/content/images/2023/06/Prometheus-Operator-Guide-copy.jpg)
Prometheus Operator Guide
What is Prometheus Operator, how it can be used to deploy Prometheus Stack in Kubernetes environment
Last9
![Prometheus and Grafana](https://last9.ghost.io/content/images/2023/06/prometheus-grafana--1-.jpg)
Prometheus and Grafana
What is Prometheus and Grafana, What is Prometheus and Grafana used for, What is difference between Prometheus and Grafana.
Last9
![Understanding Metrics, Events, Logs and Traces - Key Pillars of Observability](https://last9.ghost.io/content/images/2023/04/melt.jpg)
Understanding Metrics, Events, Logs and Traces - Key Pillars of Observability
Understanding Metrics, Logs, Events and Traces - the key pillars of observability and their pros and cons for SRE and DevOps teams.
Prathamesh Sonpatki
![SRE vs Platform Engineering](https://last9.ghost.io/content/images/2023/07/SRE-vs-Platform-Engineering.png)
SRE vs Platform Engineering
What's the difference between SREs and Platform Engineers? How do they differ in their daily tasks?
Last9
![Prometheus vs Datadog](https://last9.ghost.io/content/images/2022/12/Prometheus-vs-Datadog.jpg)
Prometheus vs Datadog
Comparison between Prometheus and Datadog - two of the most popular monitoring tools in the market today
Last9
![What is Prometheus Remote Write](https://last9.ghost.io/content/images/2023/05/prom-remote-write.jpg)
What is Prometheus Remote Write
Learn about what is Prometheus Remote Write and how to configure it.
Last9
![What is Prometheus](https://last9.ghost.io/content/images/2023/05/prometheus.jpg)
What is Prometheus
What is Prometheus, how to use it and challenges of scaling Prometheus
Last9
![Who should define Reliability — Engineering, or Product?](https://last9.ghost.io/content/images/2023/05/reliability-teams.jpg)
Who should define Reliability — Engineering, or Product?
Whoever owns Reliability should define its parameters. But who owns the Reliability of a Product? Engineering? Product Management? Or the Customer success team?
Piyush Verma
![Interesting talks on Observability from Fosdem 2023](https://last9.ghost.io/content/images/2023/02/photo-1503122703469-3dbbe39d0d1c.jpeg)
Interesting talks on Observability from Fosdem 2023
A recap of the talks from the Observability and Monitoring dev room at Fosdem 2023.
Prathamesh Sonpatki
![Prometheus Monitoring](https://last9.ghost.io/content/images/2023/02/Prometheus-Monitoring.jpg)
Prometheus Monitoring
Prometheus is a popular open-source monitoring system. In this blog, we'll cover the basics of Prometheus monitoring, including its architecture, key features, and alternatives.
Last9
![When should I start thinking of observability?](https://last9.ghost.io/content/images/2023/02/180-04-45-26-8007.png)
When should I start thinking of observability?
How does one scale metrics maturity in a cloud-native world — A guide on observability tooling as your engineering org scales.
Piyush Verma
![India vs Pakistan, Site Reliability Engineering, and Shannon Limit](https://last9.ghost.io/content/images/2022/11/India-vs-Pakistan--Shannon-Limit--and-Site-Reliability-Engineering-copy-4.jpg)
India vs Pakistan, Site Reliability Engineering, and Shannon Limit
How does one ‘detect change’ in a complex infrastructure, so you don’t lose out on critical revenues — A short SRE story
Satyajeet Jadhav
![Kubernetes Monitoring with Prometheus and Grafana](https://last9.ghost.io/content/images/2022/11/kubernetes-promtheus-copy.jpg)
Kubernetes Monitoring with Prometheus and Grafana
A guide to help you implement Prometheus and Grafana in your Kubernetes cluster
Last9
![Static Threshold vs. Dynamic Threshold Alerting](https://last9.ghost.io/content/images/2022/10/Static-Threshold-vs.-Dynamic-Threshold-Alerting-copy.jpg)
Static Threshold vs. Dynamic Threshold Alerting
What's the difference between Static Threshold vs Dynamic Threshold Alerting? Do you really know when and how to use each threshold type?
Last9
![Sample vs Metrics vs Cardinality](https://last9.ghost.io/content/images/2022/08/Cube-Creative.jpg)
Sample vs Metrics vs Cardinality
When dealing with Time Series databases, I always got confused with Sample vs Metrics vs Cardinality. Here’s an explanation as I have understood it.
Piyush Verma