Skip to content
Last9

Discover Hosts

Monitor and analyze infrastructure performance across all your hosts with comprehensive system metrics

The Hosts feature in Discover provides comprehensive infrastructure monitoring, delivering real-time visibility into system performance across all your hosts. Monitor CPU usage, memory consumption, storage capacity, and detailed system metrics to optimize resource allocation, identify performance bottlenecks, and maintain healthy infrastructure across your entire environment.

Host Detail Overview

This infrastructure monitoring solution helps you proactively identify resource constraints, track system health trends, and ensure optimal host performance for your applications and services.

Prerequisites

To monitor your host infrastructure with Last9, you need to configure at least one of the following data collection integrations:

Required (Choose at least one):

  • Host Metrics: Core system metrics collection for CPU, memory, disk, and network monitoring. Configure the Host Metrics integration to collect infrastructure metrics via OpenTelemetry collectors
  • Kubernetes Operator (recommended for Kubernetes deployments): Comprehensive Kubernetes monitoring including host-level metrics. Configure the Kubernetes Operator for Kubernetes environments
  • Kubernetes Cluster Monitoring: Alternative Kubernetes monitoring solution that includes host metrics collection. Set up Kubernetes Cluster Monitoring for cluster-wide infrastructure monitoring

You can use any combination of these integrations based on your infrastructure setup. For Kubernetes environments, the Kubernetes Operator is the recommended choice as it provides the most comprehensive monitoring capabilities.

Understanding the Hosts Dashboard

Access the Hosts dashboard at Discover > Hosts in Last9.

Hosts Overview

The Hosts dashboard displays all monitored infrastructure in a unified table with key performance indicators at a glance:

  • Host ID: Unique identifier for each monitored host
  • Host IP: Network address of the host
  • Job: Associated collection job
  • Uptime: How long the host has been running
  • CPU: Current CPU utilization with visual indicators
  • Memory: RAM usage showing used/total capacity
  • Root Volume: Primary disk usage percentage

Use the filtering capabilities to focus on specific hosts:

  1. Click on any column header to sort hosts by that metric
  2. Use the search box to filter by host ID or IP address
  3. Select multiple hosts using the checkboxes for bulk analysis
  4. Toggle between “ALL” and “NONE” to quickly select or deselect all hosts

Color-coded metrics help identify hosts requiring attention - green indicates normal operation while red suggests potential issues that need investigation.

Analyzing Individual Hosts

Click on any host to access comprehensive performance data and system analysis.

Host Detail Overview

Overview

The Overview tab provides high-level resource utilization dashboards with essential system metrics:

  • Resource Summary: View current CPU utilization, memory consumption, root volume usage, and network throughput at a glance
  • Performance Charts: Track CPU usage, memory consumption, and storage device usage over time with detailed graphs
  • Host Metadata: Essential configuration details including Host IP, uptime duration, instance type, container information, availability zone, and system architecture

Metrics

The Metrics tab offers comprehensive system performance analytics with detailed monitoring capabilities:

Host Detail Metrics

Core System Metrics:

  • CPU Usage: Processor utilization tracking over time for performance optimization
  • Memory Usage: RAM consumption patterns with available memory monitoring for capacity planning
  • Storage Device Usage: Disk utilization for mounted volumes and storage performance analysis
  • Network Bandwidth Usage: Network I/O rates and throughput monitoring for connectivity analysis

Advanced System Metrics:

  • System Load: System load averages indicating overall system stress and resource demand
  • Disk R/W Data: Read/write operations and throughput rates for storage performance optimization
  • Disk R/W Time: I/O operation latency and timing analysis for identifying storage bottlenecks
  • Disk IOps Completed: Input/output operations per second for storage performance monitoring
  • Time Spent Doing I/Os: Time spent on disk operations for I/O efficiency analysis
  • Network Sockstat: Network socket statistics and connection monitoring for network health
  • Open File Descriptor/Context Switches: System-level resource usage for process management analysis

Best Practices

Infrastructure Monitoring Strategy:

  • Regularly review host performance to identify trends and potential capacity issues before they impact applications
  • Monitor both individual host metrics and overall infrastructure health patterns
  • Use color-coded indicators to quickly identify hosts requiring immediate attention
  • Set up systematic monitoring schedules to track infrastructure health over time

Resource Optimization:

  • Use historical CPU and memory data to plan for capacity expansion and optimize resource allocation
  • Monitor disk I/O patterns to identify storage bottlenecks and optimize disk usage
  • Track network bandwidth utilization to plan for network capacity and identify connectivity issues
  • Analyze system load trends to understand resource demand patterns and optimize workload distribution

Performance Analysis:

  • Establish baseline performance ranges for your hosts to quickly identify anomalies and performance degradation
  • Monitor advanced metrics like file descriptor usage and context switches to identify system-level bottlenecks
  • Use storage device metrics to optimize disk allocation and identify potential hardware issues
  • Correlate network metrics with application performance to understand infrastructure impact on service delivery

Troubleshooting Workflow:

  • Start with the Overview tab to identify resource utilization anomalies and system health issues
  • Use the Metrics tab for detailed performance analysis and trend identification
  • Monitor system load and I/O metrics to identify infrastructure bottlenecks
  • Analyze network statistics to understand connectivity and throughput issues

Troubleshooting

Please get in touch with us on Discord or Email if you have any questions.