Changelog¶
1.75.0 (2025/06/11)¶
This is an iterative release with the following key updates:
New Features¶
- Added time calibration function for data collection, where the collection time is based on DataWay time (#2664).
- Added
multipart/form-data
on HTTP dial-testing body (#2630). - Added election feature to the dial-testing collector (#2649).
Issue Fixes¶
- Fixed an issue where the SNMP collector could not exit normally in certain scenarios (#2705).
Function Optimizations¶
- Adjusted Kubernetes event change fields (#2716).
- Enhanced MySQL object collection (#2709).
- Enabled the statsd collector by default and optimized the Helm installation package (#2706).
1.74.2(2025/06/04)¶
This release is a hotfix, with the following updates:
Bug Fixes¶
- Fixed global tags missing on Kubernetes pod count metric (#2715)
1.74.1(2025/06/03)¶
This release is a hotfix, with the following updates:
Bug Fixes¶
- Fixed duplicated ClusterRole definition in Helm package (#2711)
- Add
.Values.datakit.cache.mountPath
to set cache mount path(default /root/datakit_cache) (#2707)
1.74.0 (2025/05/28)¶
This is an iterative release with the following key updates:
New Features¶
- Added PHP-FPM collector (#2608).
- Added object collection for MySQL (#2679).
Issue Fixes¶
- Fixed incomplete collection on DDTrace resource (#2600).
- Fixed election not working for Kafka/Jenkins/Jolokia collectors (#2688).
- Fixed bug introduced in v1.73 that force-modified existing resource limits during install/upgrade (#2695).
- Fixed the missing
cluster
tag in Kubernetes object counting and object changes (#2696). - Fixed data record/playback not working (#2699).
- Fixed bug on KubernetesPrometheus up metrics when endpoints are unavailable (#2704).
- Enabled support for
SHA224/SHA256/SHA384/SHA512
in SNMP (#2697).
Function Optimizations¶
- DDTrace added
out_host
field. - Added a unified HTTP header
Referer: DataKit
to DataKit requests to DataWay APIs (#2703). - Added new fields
age/capacity_storage/access_modes/requests_storage
for Kubernetes PV/PVC objects (#2667). - Optimized collection of container CPU/memory limit metrics in pure container environments (#2669).
- Improved handling of large logs for OpenTelemetry logging collection (#2671).
- Optimized log include/exclude collection strategies (#2672/#2686).
- Optimized Helm default configuration, removed KSM, and added an
ENV_CLUSTER_NAME_K8S
configuration entry (#2683). - Enhanced Oracle metric collection by adding metrics related to locked-sessions and waiting-events, and refactored Oracle dashboard (#2684).
- Added custom timeout configuration for the dialtesting collector (#2693).
- Other updates (#2702):
- Added best practices documentation for APM sampling (#2673).
- Updated a batch of dashboards (!3513/!3498/!3494).
1.73.0 (2025/05/14)¶
This release is an iterative release, and the main updates are as follows:
New Features¶
- Added
/v1/ntp
HTTP API, which is mainly used to verify the point timestamps on RUM data (#2666).
Issue Fixes¶
- Fix the HTTP header
User-Agent
of DataKit itself for HTTP dialtesting (#2662). - Fix APM automatic injection for Java subprocess (#2645).
- Fix container logs cannot be collected in the host installation mode introduced in version 1.72.0 (#2650).
- Fix ineffective DDTrace sampling introduced introduced in version 1.72.0 (#2670).
Function Optimizations¶
- Adjust the change event fields for Kubernetes objects (#2647).
- Add the
mount_point
field in the Windows disk collector (#2652). - Extract the database-related meta information to the first-level fields in the DDTrace/OpenTelemetry APM collector (#2635).
- The KV mechanism issued by the center adds support for Kubernetes Pod logs (configure KV in the annotation) and metric collection (configure KV in the KubernetesPrometheus collector conf) (#2635).
- Other adjustments (#2663/#2651/#2646/#2627/#2624/#2564/#2598/#2422).
1.72.0 (2025/04/23)¶
This release is an iterative update, with the following key changes:
New Features¶
- Added the Windows Remote collector, which supports remote collection of basic information from Windows hosts via SNMP/WMI (#2591).
- Added collection for major Kubernetes object changes (#2514).
- Added a RUM environment variable API to the DataKit HTTP API (#2622).
Bug Fixes¶
- Fixed an issue where the
LoggingSourceMultilineMap
configuration was not taking effect (#2625).
Function Optimizations¶
- In major database collectors, custom metric collection now supports setting independent collection intervals (#2604).
- In log collection, the default log level has been changed from
unknown
toinfo
(#2628). - Add environment to disable host cloud-meta sync (#2631)
- Other adjustments (#2564/#2603).
1.71.0 (Thu Apr 10 CST 2025)¶
This release is an iterative release, with the following main updates:
New Features¶
- Pyroscope added support for Rust (#2602).
Bug Fixes¶
- Fixed the issue that monitor may crash in some cases (#2610).
- Fixed the issue of APM automatic instrumentation failure (#2607).
- In the log collection of some collectors, the automatic segmentation of the log length was supplemented to avoid the loss of long logs (#2613).
- Fixed the issue that the custom tags of the dial test collector cannot be reported (#2616).
Function Optimization¶
- Added X-Pkg-ID to the HTTP header for data packet tracking (#2587).
- Added the
source_host/source_component
fields to the data collected from Kubernetes events (#2606). - In the DDTrace resource catalog collection, the user-defined tags were promoted to first-level fields to enable sinking (#2609).
- Optimized the DDTrace sampling strategy. For traces with sampler-keep, DataKit will no longer sample but directly retain them (#2614).
- The WAL disk cache allows some data categories not to discard data when the disk is full (#2620).
- Added global tags to Profile and RUM session replay data to enable sinking (#2621).
- The eBPF collection added more Kubernetes labels, such as
cronjob/daemonset/statefulset
, etc. (#2571). - Other optimizations (#2615).
1.70.0 (March 26, 2025)¶
This release is an iterative release, with the following main updates:
New Features¶
- KubernetesPrometheus added the up metric (#2577).
Bug Fixes¶
- Fixed the issue that the disable of Pod stdout log collection is ineffective in some cases (#2570).
- Optimized the handling of extremely long multi-line logs (exceeding the maximum allowed HTTP Post length) (#2572).
- Fixed the cross-origin issue when OpenTelemetry receives the trace data pushed by the front end (#2592).
- Fixed the issue of APM auto-injection failure (#2594).
- Fixed the issue that the disk collector fails to obtain the disk usage (#2597).
- Fixed the issue of the Zipkin collector handling
application/json; charset=utf-8
(#2599). - Fixed memory leak in lsblk collector (!3458)
Function Optimization¶
- The SQLServer collector added support for the 2008 version (#2584).
- The database collectors (MySQL/Oracle/PostgreSQL/SQLServer) added a metric disable feature (some metrics are not collected to relieve the pressure on the database caused by the collection itself). At the same time, metrics are recorded were made for the SQL execution during the collection process (#2579).
- User-defined tags were added to the custom objects collected by DDTrace (#2593).
- The NFS collector added read and write latency metrics (#2601).
1.69.1 (March 18, 2025)¶
This release is a hotfix, with the following updates:
Bug Fixes¶
- Fixed the issue of incorrect CPU collection for Docker containers (#2589).
- Fixed the memory leak problem caused by the script execution of the dial-testing collector (#2588).
- Optimized the error messages of the multi-step dial test (#2567).
- Other documentation update (#2590)
1.69.0 (March 12, 2025)¶
This release is an iterative release, with the following updates:
New Features¶
- APM auto instructions adds support for injecting statsd (#2573).
- Pipeline adds support for key event data (#2585).
Bug Fixes¶
- Fixed the issue that
host_ip
cannot be obtained after the host restarts (#2543).
Function Optimization¶
- Optimized the process collector and added several process related metrics (#2366).
- Optimized the processing of the trace-id field in DDTrace (#2569).
- Added the
base_service
field in OpenTelemetry collection (#2575). - Adjusted the default settings of WAL. The number of workers is defaulted to the CPU-limit cores * 8, and the number of workers and the disk cache size can be specified during the installation/upgrade stage (#2582).
- Removed the pid detection when DataKit runs in the container environment (#2586).
Compatibility Adjustments¶
-
Optimized the disk collector to ignore some file system types and mount points (#2566).
Adjusted the disk metric collection and updated the disk list collection in the host object. The main differences are as follows:
- Added the mount point ignore option: This adjustment is mainly to optimize the process of DataKit obtaining the disk list in Kubernetes, filtering out some unnecessary mount points, such as the ConfigMap configuration mount (
/usr/local/datakit/.*
) and the mount caused by Pod log collection (/run/containerd/.*
); meanwhile, it avoids the addition of invalid time series(these new time series are mainly caused by different mount points). - Added the file system ignore option: Some file systems that are not necessary to collect, such as
tmpfs/autofs/devpts/overlay/proc/squashfs
, etc., are default ignored. - In the host object collection, the same default ignore strategy as the disk metric collection is adopted.
After such adjustments, the number of time series can be greatly reduced. Meanwhile, when we configure monitoring, it is easier to understand and avoid the trouble caused by numerous mount points.
- Added the mount point ignore option: This adjustment is mainly to optimize the process of DataKit obtaining the disk list in Kubernetes, filtering out some unnecessary mount points, such as the ConfigMap configuration mount (
1.68.1 (February 28, 2025)¶
This release is a hotfix, the content is as follows:
Bug Fixes¶
- Fixed the memory consumption problem of OpenTelemetry metric collection (#2568).
- Fixed the crash problem caused by eBPF parsing the PostgreSQL protocol (!3420).
1.68.0 (February 27, 2025)¶
This release is an iterative release, with the following updates:
New Features¶
- Added the multi-step dial-test function (#2482).
Bug Fixes¶
- Fixed the problem of clearing the multi-line cache in log collection (!3419).
- Fixed the default configuration problem of xfsquota (!3419).
Function Optimization¶
- The Zabbix Exporter collector added compatibility with lower versions (v4.2+) (#2555).
- The
setopt()
function is provided to customize the processing of log levels when Pipeline processes logs (#2545). - When the OpenTelemetry collector collects histogram metrics, it is defaulted to convert them into Prometheus style histograms (#2556).
- Adjusted the CPU-limit method when installing DataKit on the host. The newly installed DataKit defaults to using the limit mechanism based on the number of CPU cores (#2557).
- The Proxy collector added the source IP whitelist mechanism (#2558).
- The collection of Kubernetes container and Pod metrics allows for targeted collection by namespace/image, etc. (#2562).
- The memory/CPU completion of Kubernetes containers and Pods is collected based on the percentage of Limit and Request (#2563).
- AWS cloud synchronization added IPv6 support (#2559).
- Other problem fixes (!3418/!3416).
Compatibility Adjustments¶
- When collecting OpenTelemetry metrics, the name of the measurement was adjusted. The original
otel-service
was changed tootel_service
(!3412).
1.67.0 (February 12, 2025)¶
This release is an iterative release, with the following updates:
New Features¶
- KubernetesPrometheus supports adding HTTP header settings during collection and, incidentally, supports configuring the bearer token in string form (#2554).
- Added the xfsquota collector (#2550).
- AWS cloud synchronization added IMDSv2 support (#2539).
- Added the Pyroscope collector for collecting Java/Golang/Python Profiling data based on Pyroscope (#2496).
Bug Fixes¶
Function Optimization¶
- Improved the documentation related to DCA configuration (#2553).
- OpenTelemetry collection supports extracting the event field as a first-level field (#2551).
- Improved the DDTrace Golang documentation and added instructions for compile time instrumentation (#2549).
1.66.2(2025/01/17)¶
This release is a hotfix update, with the following enhancements and fixes:
Bug Fixes¶
- Fixed Pipeline debug API compatible issue (!3392)
- Fixed UDS listen bug (#25344)
- Added
linux/arm64
support for UOS images (#2529) - Fixed prom v2 tag precedence bug (#2546) and Bearer Token bug (#2547)
1.66.1 (2025/01/10)¶
This release is a hotfix update, with the following enhancements and fixes:
Bug Fixes¶
- Fixed the timestamp precision issue in the prom v2 collector (#2540).
- Resolved the conflict between the PostgreSQL
index
tag and DQL keywords (#2537). - Fixed the missing
service_instance
field in SkyWalking collection (#2542). - Removed unnecessary configuration fields in OpenTelemetry and fixed the missing
unit
tags for some metrics (#2541).
1.66.0 (2025/01/08)¶
This release is an iterative release. The main updates are as follows:
New Features¶
- Added KV mechanism to support pulling updates for collector configurations (#2449)
- Added AWS/Huawei Cloud object storage support for remote job (#2475)
- Added new NFS collector (#2499)
- The test data for the Pipeline debugging API supports more HTTP
Content-Type
(#2526) - Added Docker container support for APM Automatic Instrumentation (#2480)
Bug Fixes¶
- Fixed the issue where the OpenTelemetry collector could not handle micrometer data (#2495)
Optimizations¶
- Optimized disk metric collection and disk collection in host objects (#2523)
- Optimized Redis slow log collection, adding client information to the slow log. Meanwhile, slow log provides some support for low-version (<4.0) Redis (such as Codis) (#2525)
- Adjusted the error-retry mechanism of the KubernetesPrometheus collector during metric collection. When the target service is temporarily offline, it will no longer be removed from collection (#2530)
- Optimized the default configuration of the PostgreSQL collector (#2532)
- Added a configuration entry for trimming metric names for Prometheus metrics collected by KubernetesPrometheus (#2533)
- DDTrace/OpenTelemetry collectors now support actively extracting the
pod_namespace
tag (#2534) - Enhanced the log collection scan mechanism by mandating a 1-minute scan interval to prevent log file missing in extreme scenarios (#2536).