Skip to content

Data Collection


Guance supports collecting performance data and operational status information from hosts, cloud hosts, containers, and other resources to help monitor infrastructure. The collected data includes core metrics such as CPU, memory, disk, and network. After processing, the collected data is displayed through a visual interface, making it easy to grasp the operational status of infrastructure in real-time.

Prerequisites

Install DataKit.

Configuration

HOST

After installing DataKit, a set of host-related collectors are automatically enabled, and data is automatically reported to the workspace.

The list of collectors enabled by default is as follows:

Collector Name Description
cpu Host CPU usage
disk Disk usage
diskio Host disk IO status
mem Host memory usage
swap Swap memory usage
system Host operating system load
net Host network traffic status
host_process List of processes surviving more than 10min on the host
hostobject Basic host information (operating system, hardware information, etc.)
container Host container or Kubernetes data (if the host has no containers, the collector automatically exits)
Note
  • After host collection starts, modifying the host name host_name will add a new host record. The original host name is still retained and removed after 24 hours without data reporting.

  • DataKit counts the maximum number within a 24-hour period for billing. Therefore, modifying the host name within a billing cycle will result in charges for two hosts.

For more details, refer to DataKit Collector Usage, Host Object.

Cloud Hosts

If DataKit is deployed on a cloud host, you can enable cloud synchronization by setting the cloud_provider tag. After configuration, restart DataKit.

For more details, refer to Enable Cloud Synchronization.

CONTAINERS

Guance provides two ways to enable container data collection:

  1. Install DataKit on the host: Enable the Container collector to collect Containers and Pods data.
  2. Install DataKit using DaemonSet: Supports collecting full data from container components (Containers, Pods, Services, etc.).

Processes

The process collector is enabled by default but does not collect process metric data. To collect metric data, navigate to the conf.d/host folder in the DataKit installation directory, copy host_processes.conf.sample and rename it to host_processes.conf. Set open_metric to true and restart DataKit.

For more details, refer to Processes.

Resource Catalog

Create custom resources, associate them with the DataKit API and DataFlux Func, and report any data to Guance.

For specific steps, refer to Resource Catalog Data Reporting.

Feedback

Is this page helpful? ×