Data Collection¶
Guance supports collecting performance data and operational status information from hosts, cloud hosts, containers, and other resources to help monitor infrastructure. The collected data includes core metrics such as CPU, memory, disk, and network. After processing, the collected data is displayed through a visual interface, making it easy to grasp the operational status of infrastructure in real-time.
Prerequisites¶
Configuration¶
HOST¶
After installing DataKit, a set of host-related collectors are automatically enabled, and data is automatically reported to the workspace.
The list of collectors enabled by default is as follows:
| Collector Name | Description |
|---|---|
cpu |
Host CPU usage |
disk |
Disk usage |
diskio |
Host disk IO status |
mem |
Host memory usage |
swap |
Swap memory usage |
system |
Host operating system load |
net |
Host network traffic status |
host_process |
List of processes surviving more than 10min on the host |
hostobject |
Basic host information (operating system, hardware information, etc.) |
container |
Host container or Kubernetes data (if the host has no containers, the collector automatically exits) |
Note
-
After host collection starts, modifying the host name
host_namewill add a new host record. The original host name is still retained and removed after 24 hours without data reporting. -
DataKit counts the maximum number within a 24-hour period for billing. Therefore, modifying the host name within a billing cycle will result in charges for two hosts.
For more details, refer to DataKit Collector Usage, Host Object.
Cloud Hosts¶
If DataKit is deployed on a cloud host, you can enable cloud synchronization by setting the cloud_provider tag. After configuration, restart DataKit.
For more details, refer to Enable Cloud Synchronization.
CONTAINERS¶
Guance provides two ways to enable container data collection:
- Install DataKit on the host: Enable the Container collector to collect Containers and Pods data.
- Install DataKit using DaemonSet: Supports collecting full data from container components (Containers, Pods, Services, etc.).
Processes¶
The process collector is enabled by default but does not collect process metric data. To collect metric data, navigate to the conf.d/host folder in the DataKit installation directory, copy host_processes.conf.sample and rename it to host_processes.conf. Set open_metric to true and restart DataKit.
For more details, refer to Processes.
Resource Catalog¶
Create custom resources, associate them with the DataKit API and DataFlux Func, and report any data to Guance.
For specific steps, refer to Resource Catalog Data Reporting.