Changelog (2022)¶
This document records the update content description of each online release of Guance, including DataKit, Guance Best Practices, Guance Integration Documentation, and Guance.
December 29, 2022¶
Guance Updates¶
New MFA Authentication Management¶
Guance provides MFA authentication management to add an extra layer of security beyond the account username and password. After enabling MFA authentication, you will need to perform a secondary verification during login, helping to improve the security of your account. For more details, please refer to the document MFA Management.
New Workspace IP Access Login Whitelist¶
Guance supports configuring an IP whitelist for workspaces to restrict visiting users. After enabling the IP whitelist, only requests from IPs in the whitelist can log in normally, and requests from other sources will be denied access. For more details, please refer to the document IP Whitelist.
Note: The IP whitelist can only be set by administrators and owners, and the "Owner" is not restricted by the IP whitelist access limitations.
New Service Network Topology¶
In a K8S environment, you can use the Service network topology map to view the request relationships, response times, error rates, etc., between various Services in the K8S environment. When a connection problem is found in a Service, you can click to view the logs corresponding to that Service to locate the issue. For more details, please refer to the document Service Network.
Other Feature Optimizations¶
- Pipeline usage optimization
- Log clustering analysis supports custom fields
- Ultra-large logs support viewing associated logs after being split
- Backup log display optimization
- Data authorization sensitive fields support all types of data except Metrics
DataKit Updates¶
- Prometheus collector supports data collection via Unix Socket
-
Allows non-root users to run DataKit
-
Optimized httpflow protocol determination in eBPF
- Optimized Datakit installation and upgrade commands on Windows
- Optimized Pythond usage encapsulation
- Pipeline provides more detailed operation error information
- Pipeline reftable provides a SQLite-based localized storage implementation
For more DataKit updates, please refer to DataKit Version History.
Intelligent Inspection Updates¶
Official Intelligent Inspections Discontinued¶
The official intelligent inspections "Memory Leak", "Application Performance Detection", and "Disk Usage Rate" are discontinued from today. Users can configure them via the Self-built Inspection script library.
New Cloud Account Instance Dimension Bill Inspection¶
The cloud account instance dimension bill inspection helps users manage abnormal cost warnings at the cloud service instance level, predict cost situations, and provides users with prompts for high-growth, high-consumption instances and bill visualization capabilities, supporting multi-dimensional visualization of cloud service resource consumption.
New Alibaba Cloud Preemptible Instance Survival Inspection¶
Since the market price of preemptible instances fluctuates with supply and demand, it is necessary to specify the bidding mode when creating a preemptible instance. A preemptible instance can only be created successfully when the real-time market price of the specified instance specification is lower than the bid and inventory is sufficient. Therefore, inspecting cloud assets for preemptible instances is particularly important. Through inspection, when it is found that a preemptible instance is about to be released, it will prompt the latest price of all available zones for the current specification of the preemptible instance and the historical price of the preemptible instance and give appropriate handling suggestions.
For more intelligent inspection updates, please refer to Intelligent Inspection Changelog.
December 15, 2022¶
Guance Updates¶
New Profile, Network Billing Items¶
Guance adds Profile and Network billing items. Profile counts the number of application performance Profiles reported in the workspace on the current day. Network counts the number of hosts (host) with network data reported in the workspace on the current day. For more details, please refer to the document Billing Methods.
Guance Registration Process Optimization¶
Guance has simplified the registration process to help you complete registration quickly and start using Guance. For more details, please refer to the document Registering for Guance Commercial Plan.
New Warroom Feature¶
Guance provides a "Warroom" dashboard. You can get an overview of the key metric trends of all workspaces in the "Warroom" dashboard, helping you quickly understand the key data situation of each workspace and discover and resolve abnormal problems in a timely manner. For more details, please refer to the document Key Metrics Management.
New Dashboard Carousel Feature¶
Guance provides a carousel function for multiple associated business dashboards. After setting, you can display them on a large screen. For more details, please refer to the document Carousel Dashboards.
Workspace Binding External SLS Index¶
Guance adds support for binding SLS indexes, helping you to uniformly and quickly view and analyze your log data. At the same time, Guance adds support for field mapping, helping you map SLS index fields for cluster analysis. For more details, please refer to the document Log Index.
Trace Explorer Chart Display Optimization¶
Added "Span Count" and "Error Span Count" trend charts in the trace explorer. You can quickly view the trend of metrics such as Span count, Error Span count, request count, Error request count, and response time through the explorer's charts.
Other Feature Optimizations¶
- View variable interaction optimization
- Built-in view binding configuration interaction optimization
- Workspace overdue lock process optimization
- Workspace invitation user experience optimization
- Help documentation adds an evaluation system, including better and worse document ratings
- Monitor batch function optimization
- SLO adds enable/disable function
- Fuzzy matching Wildcard left * matching query limit optimization
DataKit Updates¶
- Added Golang Profiling access
- logfwd supports image field injection via
LOGFWD_TARGET_CONTAINER_IMAGE - trace collector:
- Optimized error-stack/error-message format issues
- SkyWalking compatibility adjustments, supporting the full 8.X series
- eBPF httpflow adds
pid/process_namefield, optimized kernel version support - datakit.yaml has been adjusted, it is recommended to update the yaml
- GPU graphics card collection supports remote mode
For more DataKit updates, please refer to DataKit Version History.
Intelligent Inspection Updates¶
Intelligent Inspection Adds Guide Page¶
Guance supports three intelligent inspection templates by default: Memory Leak, Disk Usage Rate, and Application Performance Detection. After this update, existing workspaces support manually deleting the default intelligent inspection templates, while prompting the system-wide offline time; newly created workspaces no longer provide these three intelligent inspections by default, adding a new user guide prompt.
Intelligent Inspection Components Support Configuring Custom Jump Links¶
Support configuring custom jump links when configuring self-built inspections. For how to configure self-built inspections, please refer to the document Self-built Inspection.
For more intelligent inspection updates, please refer to Intelligent Inspection Changelog.
Best Practices Updates¶
- Insight
- Scene (Scene) - SpringBoot Project External Tomcat Scene Link Observability.
For more best practices updates, please refer to Best Practices Version History.
December 1, 2022¶
Guance Updates¶
Application Performance New Service List Feature¶
The Application Performance Monitoring service list supports real-time viewing of ownership, dependencies, performance, associated dashboards, and associated analysis of different services, quickly discovering and resolving service performance issues, helping teams efficiently build and manage large-scale end-to-end distributed applications. For more details, please refer to the document Service List.
Dashboard Chart Usage Experience Optimization¶
- Added manual input for chart units, colors, and aliases. You can customize preset units, colors, and aliases according to the current specifications, such as the input format for units: aggregation function(metric), e.g.,
last(usage_idle) - Time series charts add Y-axis configuration, you can manually configure the maximum and minimum values of the Y-axis
- Time series charts and pie charts add group display. After enabling, only tag values are displayed in the legend
- Honeycomb charts, China maps, and world maps add gradient intervals, including automatic and custom
- Bar chart and histogram display optimization
For more details, please refer to the document Visualization Charts.
View Variable Usage Experience Optimization¶
- View variables now support multiple selections
- When configuring view variables, added the option to set multiple default values
For more details, please refer to the document View Variables.
User Access Monitoring Session Viewer Adjustment¶
In the User Access Monitoring Session viewer, added "Session" and "All Records" view lists:
- Session: Statistics data deduplicated by Session within the current time range, displaying the latest Session data by default;
- All Records: Display all reported data for Sessions, where the same Session may correspond to multiple data records.
Note: In the Session list, adding display columns is supported. If the display columns added by the user are fields that do not exist in the current Session data, there may be empty data in the session list. For more details, please refer to the document Session.
Events Add Mobile App Jump Option¶
After users receive an alert event notification, clicking "Go to Guance to view" can go to the Guance mobile APP to view the details of the abnormal event.
Other Feature Optimizations¶
- New user guide page optimization, supports minimization
- User Access Monitoring application list optimization, added time widget custom interval switching query
- Host Label addition interaction optimization
DataKit Updates¶
- Added Python Profiling access
- Pythond adds custom event reporting function
-
netstat supports metric collection for specific ports
-
Optimized Redis collector CPU usage collection, added new metric fields
- Optimized logfwd collector configuration
- Completed host object field collection, added network, disk and other related fields
For more DataKit updates, please refer to DataKit Version History.
Intelligent Inspection Updates¶
- Kubernetes Pod Abnormal Restart Inspection
- MySQL Performance Inspection
- Server Application Error Inspection
- Memory Leak Inspection
- Disk Usage Rate Inspection
- Application Performance Inspection
- Frontend Application Log Error Inspection
For more intelligent inspection updates, please refer to Intelligent Inspection Changelog.
Best Practices Updates¶
- Monitoring
- Application Performance Monitoring (APM) - Performance Optimization - Using async-profiler for Application Performance Tuning
For more best practices updates, please refer to Best Practices Version History.
November 17, 2022¶
Guance Updates¶
Pipeline Usage Experience Optimization¶
- Pipeline supports configuring multiple selections for filter conditions
- Supports setting any Pipeline script as the "default Pipeline script". If the current data type does not match other Pipeline scripts during Pipeline processing, the data will be processed according to the rules of the default Pipeline script.
- Script function categorization
Event Optimization¶
- Supports writing user custom events. For more details, please refer to the document Writing Event Data via openAPI.
- Unrecovered event viewer left side adds quick filter
Custom Explorer Supports Selecting More Types of Data¶
When creating an explorer in a scene, supports selecting 5 data types: Log, Application Performance, User Access, Security Inspection, Profile. Cannot be changed after saving.
Member Management Adds Member Grouping Function¶
Added member grouping function in member management. You can edit member groups. A member can belong to multiple groups; meanwhile, when configuring notification targets for alert policies, support configuring member group labels.
Optimized Built-in View Binding Function¶
In the scene "Built-in Views" - "User Views", the binding field adds support for fuzzy matching. After the field is bound, the associated built-in view can be viewed in the explorer.
New One-click Import/Export of Dashboards, Custom Explorers, Monitors within the Workspace¶
In "Management" - "Basic Settings", added one-click import/export of JSON configurations for dashboards, custom explorers, monitors, etc., within the workspace. You can import dashboards, custom explorers, monitors, etc., from other workspaces into the current workspace.
Application Performance Monitoring Log Page Adds Custom Association Fields¶
On the Application Performance Monitoring details page, on the associated log page, if you are an administrator or above, you can customize association fields. You can click the settings button on the right side of the association field, select the fields that need to be associated in the pop-up dialog box, support manual input, support drag-and-drop order, and confirm to complete the configuration.
Other Feature Optimizations¶
- When using fuzzy matching and fuzzy non-matching in filter conditions, supports left * matching
- Infrastructure analysis dimension optimization, last reported time of host on host details page
- Supports users quickly logging into the last single sign-on
- Blacklisted application performance monitoring adds support for filtering "All Services"
- Users registered at the "China Region 4 (Guangzhou)" site optimized process for upgrading to the commercial version
- Field descriptions support viewing in quick filters, display columns, etc.
DataKit Updates¶
- Added SNMP Collector
- Added IPMI Collector
- Added batch injection DDTrace-Java tool
- The latest DDTrace-Java SDK enhances SQL desensitization function
- Remote Pipeline optimization
- Pipeline supports source mapping relationship configuration, facilitating batch configuration between Pipeline and data sources
- Pipeline provides function classification information, facilitating remote Pipeline writing
- Optimized Kafka Message Subscription, no longer limited to obtaining SkyWalking related data, also supports rate limiting, multi-version coverage, sampling, and load balancing settings
- By providing additional configuration parameters (
ENV_INPUT_CONTAINER_LOGGING_SEARCH_INTERVAL), alleviates short lifecycle Pod log collection issues - In a pure container environment, supports configuring container log collection via label method
- Added Pipeline functions
- sample(): Sampling function
- b64enc(): Base64 encoding function
- b64dec(): Base64 decoding function
- append(): List append function
For more DataKit updates, please refer to DataKit Version History.
Best Practices Updates¶
-
Insight
- Guance Tips (Skills) - DataKit Configuration HTTPS
-
Monitoring
- Application Performance Monitoring (APM) - ddtrace common parameter usage
- Application Performance Monitoring (APM) - ddtrace-api usage guide
- Application Performance Monitoring (APM) - ddtrace sampling
- Application Performance Monitoring (APM) - ddtrace log correlation
For more best practices updates, please refer to Best Practices Version History.
November 11, 2022¶
Guance Adds "China Region 4 (Guangzhou)" Site¶
Guance supports multi-site registration and login, adding the "China Region 4 (Guangzhou)" site, combined with the existing "China Region 1 (Hangzhou)", "China Region 2 (Ningxia)", "Overseas Region 1 (Oregon)", comprehensively covering different regions domestically and internationally, providing users with comprehensive data observability.
Note that Guance accounts and data from different sites are independent and cannot be shared or migrated between each other. You can choose the appropriate site for registration and login based on the resources you use. Currently, Guance supports the following four sites.
| Site | Login URL | Operator | Settlement Method |
|---|---|---|---|
| China Region 1 (Hangzhou) | https://auth.guance.com/ | Alibaba Cloud | Guance Enterprise Account, Cloud Account |
| China Region 2 (Ningxia) | https://aws-auth.guance.com/ | AWS | Guance Enterprise Account, Cloud Account |
| Overseas Region 1 (Oregon) | https://us1-auth.guance.com/ | AWS | Guance Enterprise Account, Cloud Account |
| China Region 4 (Guangzhou) | https://cn4-auth.guance.com/ | Huawei Cloud | Guance Enterprise Account, Huawei Cloud Stored Value Card |
November 3, 2022¶
Guance Updates¶
New Guance, SLS Joint Solution¶
Guance adds an SLS storage solution, supporting Alibaba Cloud SLS users to quickly use Guance for data viewing and analysis. When registering/upgrading to the commercial version of Guance, after selecting "Alibaba Cloud Account Settlement", you can choose the SLS storage solution to store data under your own Alibaba Cloud account, achieving data exclusivity. For more details, please refer to the document SLS Storage.
The joint solution of Guance and SLS supports you to synchronize log data from your Alibaba Cloud account to the Guance workspace for query and analysis through the bind log index function. You can click "Bind Index" under "Logs" - "Index" in the Guance workspace to synchronize log data from the index. For more details, please refer to the document Log Index.
Optimized New User Guide Page¶
Guance adds new user guide pages for scenes, events, infrastructure, metrics, logs, application performance monitoring, user access monitoring, availability monitoring, security inspection, CI visualization, monitoring and other major functional modules. At the same time, the workspace welcome homepage is optimized, allowing you to more easily understand and use Guance through the new user guide.
Added 3 Intelligent Inspection Configuration Documents¶
- Cloud Account Bill Inspection: Helps users manage cloud service budget warnings, abnormal cost warnings, predict cost situations, and provides users with visualization capabilities, supporting multi-dimensional visualization of cloud service resource consumption;
- Frontend Application Log Error Inspection: Helps discover new error messages (clustered Error Message) that appeared in the frontend application in the past hour, helping development and operations repair code in time to avoid continuous damage to customer experience over time;
- Alibaba Cloud Asset Inspection: Helps users gain more understanding of the performance status of cloud vendor products.
New Trace Error Tracking Explorer¶
In Application Performance Monitoring, added an error tracking explorer to help you quickly view similar errors in traces and their distribution, quickly locating performance issues. For more details, please refer to the document Trace Tracking.
Optimized Time Series Chart, Overview Chart Period-over-Period Comparison Function¶
The period-over-period comparison function no longer interacts with the locked time. The button is displayed by default and is in the off state. After enabling period-over-period comparison, the comparison dimension supports 4 options: Hour (compared with one hour ago), Day (compared with one day ago), Week (compared with one week ago), Month (compared with one month ago). For more details, please refer to the document Period-over-Period Comparison.
Other Feature Optimizations¶
- Dashboards/Notes/Explorers add a "Save Snapshot" button in "Settings";
-
Time widget adds more options;
-
Pipeline and blacklist functions add import, batch export, batch delete functions;
- Intelligent inspection adds dashboard, bar chart chart components;
- Func page under the Integration menu optimization.
DataKit Updates¶
- Improved Prometheus ecosystem compatibility, added ServiceMonitor and PodMonitor collection recognition
-
Added Java Profiling access based on async-profiler
-
eBPF collection adds interval parameter to facilitate adjusting the amount of data collected
- All remote collectors default to using their collection address as the value of the
hostfield to avoid possible misunderstandings of thehostfield value during remote collection - APM data collected by DDTrace can automatically extract error-related fields, facilitating the center to do better APM error tracking
- MySQL collector adds additional fields
Com_commit/Com_rollbackcollection
For more DataKit updates, please refer to DataKit Version History.
Best Practices Updates¶
-
Monitoring
- Application Performance Monitoring (APM) - Kafka Observability Best Practices
-
Cloud Platform Access
- Alibaba Cloud - Alibaba Cloud ACK Access to Guance
For more best practices updates, please refer to Best Practices Version History.
October 20, 2022¶
Guance Updates¶
Monitoring Optimization¶
New Outlier Detection¶
Outlier detection is an algorithm that detects whether the metrics/statistical data of detection objects under specific groups have outlier deviations. If there is an inconsistency exceeding a certain degree, an outlier detection abnormal event will be generated for subsequent alert tracking. For more details, please refer to the document Outlier Detection.
Mutation Detection Logic Optimization¶
The new version of mutation detection determines whether an abnormal situation occurs by comparing the absolute or relative (%) change value of the same metric in two different time periods. It is mostly used to track peaks or data changes of a certain metric, and can more accurately generate events for recording when abnormal situations occur. For more details, please refer to the document Mutation Detection.
Range Detection Logic Optimization¶
The new version of range detection performs anomaly detection on metric data within the selected detection range time range. When the proportion of mutated abnormal data points exceeds the set percentage, a range detection abnormal event is generated. It is mostly used to monitor data/metrics with stable trends. For more details, please refer to the document Range Detection.
Other Optimizations¶
- Adjusted read-only member permissions, supporting viewing the configuration content of monitors, intelligent inspections, SLOs, mute management, and alert policies
- Data gap configuration selection "Trigger data gap event" and "Trigger recovery event" configuration adjusted to time range configuration, supporting manual input
- Supports configuring monitor detection based on "Event" data
- Alert configuration adds "Info" event notification level selection
Scene Chart Optimization¶
New Histogram Chart Component¶
A histogram, also known as a mass distribution diagram, is used to represent the distribution of data. It is a common statistical chart. Generally, the horizontal axis represents the data interval, and the vertical axis represents the distribution. For more details, please refer to the document Histogram.
Chart Function Optimization¶
- Overview charts, treemaps, funnel charts add time slicing function
- Time series chart "View similar trend metrics" changed from only supporting metric queries to supporting all data types, including logs, application performance, user access, etc.
- Top lists support viewing all content beyond the chart width
- Honeycomb chart display optimization
Explorer Optimization¶
Explorer Supports Analysis Mode¶
Added "Analysis" function in Log, Application Performance Monitoring, User Access Monitoring, Availability Detection, Security Inspection, CI explorers. Infrastructure adjusted "Grouping" to "Analysis", supporting multi-dimensional analysis statistics based on 1-3 tags to reflect the distribution characteristics and trends of data in different dimensions over time. In analysis mode, supports multiple data chart analyses, including time series charts, top lists, pie charts, and treemaps. For more details, please refer to the document Explorer Analysis Mode.
Optimized Associated Log Viewing Experience¶
In the details pages of Infrastructure, Application Performance Monitoring, User Access Monitoring, CI Visualization explorers, optimized the associated log viewing experience. By default, log content is displayed according to the "Maximum Display Rows" and "Display Columns" set in the log explorer, and supports custom adjustment through "Display Columns".
Log Index Optimization¶
After setting the index in "Logs" - "Index", log-related queries add an index option, defaulting to the index default. You can query and analyze log data according to the index you set. The functions involved include the log explorer, scene custom explorer, chart log query, log monitor, etc. For more details, please refer to the document Log Index.
Optimized Metric Analysis Chart Query¶
Added time interval and legend options in metric analysis, adjusted the display order of legend aggregation functions. For more details, please refer to Metric Analysis.
Other Feature Optimizations¶
- In the User Access Monitoring application list, click to enter the application, a drop-down menu option is added in the upper left corner to help users quickly switch and view different application data
- Added Pipelines quick access in the Metrics, User Access Monitoring, Application Performance Monitoring, Infrastructure, Security Inspection directories
DataKit Updates¶
- DataKit collector configuration and Pipeline support synchronization via etcd/Consul and other configuration centers
- Prometheus Remote Write optimization
- Collection supports filtering tags via regular expressions
- Supports filtering measurement names via regular expressions
- Pipeline optimization
- Optimized functions like grok() so that they can be used in
if/elsestatements to determine whether the operation took effect - Added match() function
- Added cidr() function
- Optimized functions like grok() so that they can be used in
- Process collector adds open file list detail fields
- Improved disk caching and queue processing for externally accessed data (T/R/L)
- Monitor adds usage overage prompt: at the bottom of the monitor, if the current space usage is over the limit, there will be a red text prompt
Beyond Usage - Optimized log collection position function. In a container environment, the file will be mounted to the host to avoid losing the original position record after DataKit restarts
- Optimized collection delay issues in sparse log scenarios
For more DataKit updates, please refer to DataKit Version History.
Best Practices Updates¶
-
Monitoring
- Infrastructure Monitoring (ITIM) - Ansible Batch Processing Practice.
-
Cloud Native
- Logs - Guance Collects Amazon ECS Logs.
For more best practices updates, please refer to Best Practices Version History.
September 29, 2022¶
Guance Updates¶
New Field Management¶
Guance supports unified management of field data in the current workspace, including system fields and custom fields. You can view field descriptions in scene chart queries, monitor detection metrics, DQL query simple query mode, etc., helping you quickly understand field meanings and apply fields. The figure below shows viewing the field description of host in a scene chart query. For more details, please refer to the document Field Management.
Optimized Metrics Function¶
Metric Explorer Transformation¶
The original "Metric Explorer" is changed to "Metric Analysis", supporting users to visually query data in time series chart mode (line chart, area chart, bar chart). For more details, please refer to Metric Analysis.
Optimized Metric Management¶
-
Supports searching for "Measurement" and "Metric" in "Metric Management";
-
Supports modifying the "Unit" and "Description" of metrics in "Metric Management";
- Supports viewing the description information of tags in "Metric Management".
For more details, please refer to Metric Management.
New Export Event Content as PDF Format¶
On the event details page, the Jump to Monitor button adds text description. Click the button, you can choose Export JSON File and Export PDF File to obtain all key data corresponding to the current event.
Monitor Adjustment¶
- Monitor name does not support customization. The event title is generated synchronously after input;
- Monitor supports "info" event generation logic. The trigger condition can choose to enable or disable information: normal detection results also generate events;
- The monitor configuration step order is adjusted to: Detection Configuration - Event Notification - Association.
Note: In the latest version, the "Monitor Name" will be generated synchronously after the "Event Title" is entered. Old monitors may have inconsistencies between "Monitor Name" and "Event Title". To provide a better user experience, please synchronize to the latest version as soon as possible, supporting one-click replacement of the event title.
Mute Rules Support Dynamic Configuration¶
In Mute Management, optimized the mute scope, supporting options such as monitors, intelligent inspections, self-built inspections, SLOs, alert policies, etc. Added tag options.
Note: Mute scope is required, tags are optional. Mute only takes effect when both the "Mute Scope" and "Tag" conditions are met. For example, if the mute scope selects Application Performance Detection and Disk Usage Rate, and the tag selects: host:izbp152ke14timzud0du15z, when the conditions simultaneously meet Application Performance Detection and host:izbp152ke14timzud0du15z or the conditions simultaneously meet Disk Usage Rate and host:izbp152ke14timzud0du15z, the alert notification is muted and will not be sent to the alert notification targets.
Optimized Pipeline Configuration Page¶
In Text Processing (Pipeline), supports adding multiple sample parsing tests, fixed the issue of one-click sample acquisition error for metric data. For more details, please refer to Text Processing (Pipeline).
Other Feature Optimizations¶
- Notes add global lock time configuration. After configuring the global lock time, all charts on the note page display data according to this lock time
- Unrecovered event query changed to data from the last 48 hours, supports manual refresh
- User Access Monitoring supports same-name user view overwrite logic
- User Access Monitoring Web, WeChat Mini Program, and Mini Program developed based on the uniapp framework initialization parameters add
isIntakeUrlconfiguration, used to determine whether to collect corresponding resource data based on the requested resource URL, default is to collect all - Intelligent inspection adds support for self-built inspections
- Generate metric page operation column adjustment, added "Open in Metric Analysis" and "Open in Metric Management" operation icons
- Management navigation menu position adjustment, SSO management moved to Member Management, Notification Target management moved to Monitoring, Built-in Views moved to Scenes
Best Practices Updates¶
- Cloud Native: Multiple Kubernetes Clusters Metric Collection Best Practices.
For more best practices updates, please refer to Best Practices Version History.
September 15, 2022¶
Guance Updates¶
New Log Explorer DQL Search Mode¶
In the log explorer, you can click the toggle icon
in the search bar to switch to the DQL manual input query mode, supporting custom input of filter conditions and search conditions.
- Filter conditions: Support any combination of
and / or, support using()parentheses to indicate search priority, support operators such as=,!=, etc.; - Search conditions: Support searching logs through the DQL function
query_string().
For more details, please refer to the document DQL Search.
Optimized Application Performance Monitoring¶
Trace Explorer Details Page Optimization¶
- Optimized details page layout display, added duration, http method, http url, http status code, TraceId, etc.
- Flame graph adds thumbnail, double-click Span to zoom in, click service name to highlight the corresponding Span
- Span list adds search, click to select Span and switch to the flame graph for synchronized display
- Service call relationship adds search
For more details, please refer to the document Trace Explorer.
Profile Explorer Details Page Optimization¶
Added quick operations in the Profile explorer details page, including search, copy, and click to select and view methods. For more details, please refer to the document Quick Operation Instructions.
Optimized Monitor Event Notification Content Editing Mode¶
When creating a monitor, the event notification content sent when the trigger conditions are met supports input and preview of markdown format text information, and supports using template variables.
New Mute Management Supports Configuring Periodic Muting¶
Mute management adds periodic mute function. In mute time, you can choose "Only Once" or "Repeat". For more details, please refer to Mute Management.
Other Feature Optimizations¶
- Scene dashboard group display optimization, scene notes optimize text component Markdown format
- Infrastructure explorer display optimization, added display column CPU usage rate, MEM usage rate and other prompt information; "Pods", "Deployments", "Services" explorer details page add view and copy corresponding YAML file functions
-
Metric explorer removes list view mode, retains tiled view mode and mixed view mode
-
Log multi-index support jump view
- Explorer quick filter value TOP 5 supports viewing proportion count, explorer time field format optimization, default display format is
2022/09/15 20:53:40 - Trace explorer time series chart adds legend display, quick filter adds HTTP related fields
DataKit Updates¶
- Added automatic cloud synchronization function, no longer need to manually specify cloud vendors
- Supports using k8s labels as tags synchronized to pod metrics and logs
- Supports collecting various yaml information in k8s to the corresponding object data
- Trace collection supports automatic extraction of some key meta information
- Supports specifying the installation source address during installation to simplify offline installation process
- Pipeline new features:
- Added for loop/dictionary/array support
- Added arithmetic expression support
- Pipeline error information will be displayed on the collected data
- If the time field切割 fails, supports automatic correction of the time field (
time) to avoid time display issues on the console page - Added len() function
For more DataKit updates, please refer to DataKit Version History.
Best Practices Updates¶
- Monitoring - Middleware (Middleware) - Insight MySQL.
For more best practices updates, please refer to Best Practices Version History.
September 1, 2022¶
Guance Updates¶
Billing Update¶
Guance billing item backup log adjusts billing rules. After adjustment, backup log is charged at 0.007 yuan per 1 GB of data/day. You can adjust the corresponding data retention policy according to actual needs to save cost. For more details, please refer to the document Billing Methods.
Help Documentation Directory Update¶
- Split the changelog directory, supporting viewing Guance update content by year;
- Added custom development directory, where you can view documents for custom collectors (Pythond), custom security inspections (Scheck), custom DDtrace, etc.;
- Added private deployment directory, where you can view documents related to Guance deployment.
Time Series Chart Adds Event Correlation Analysis¶
The event correlation analysis added to the time series chart supports matching abnormal events related to the selected field through "Add Filter Field", thereby achieving the purpose of displaying time series data and events associated, helping users perceive whether related events occurred during data fluctuations while viewing trends, to achieve problem localization.
- After configuring event correlation in the time series chart, on the time axis, if there are event records related to the selected association field (e.g., "host:DESKTOP-NEN89U3"), they will be marked with a shaded color block;
- After configuring event correlation in the time series chart, in analysis mode, click the highlighted shaded color block to view abnormal events related to the selected field (e.g., "host:DESKTOP-NEN89U3").
For more details, please refer to the document Time Series Chart.
Logs Add Multi-Index Mode¶
Guance supports setting multiple log indexes, screening log data that meets the conditions and saving them in different log indexes, and by selecting different data storage policies for log indexes, helping users save log data storage costs. For more details, please refer to the document Log Index.
Optimized Backup Log Rules¶
The new rule for backup logs is moved under the Log Index menu. The backup cycle is adjusted from backing up the previous day's logs at 0:00 every day to performing rule verification and backup every 5 minutes, that is, configured backup rules can view backed up log data at most 5 minutes later. For more details, please refer to the document Backup Logs.
Optimized Log Context¶
When viewing the context on the log details page and opening a new log explorer, it will use "host", "source", "service", "filename" as filter conditions to view data before and after the current log time.
Optimized User Access Monitoring¶
New Custom User Access Monitoring Application ID¶
User Access Monitoring supports users to customize the application ID as the unique identifier of the current workspace (the original Guance application ID is globally unique, meaning application IDs in different spaces cannot be repeated). Different workspaces can use the same application ID as the unique identifier for the same application, used for SDK collected data upload matching. For more details, please see the documents:
-
Web Application Access
-
Android Application Access
-
iOS Application Access
-
Mini Program Application Access
New User Access Monitoring Network Request Error Correlation Trace View¶
User Access Monitoring supports users to associatively view the relevant traces of network (network) errors in the RUM explorer.
Intelligent Inspection Comprehensive Upgrade¶
- Intelligent inspection list adjusts "Group" to "Alert Policy", supports selecting alert policies for intelligent inspections during editing;
- Intelligent inspection related event details page comprehensive adjustment, including related pages such as basic attributes, event details, root cause analysis, alert notification, history records, associated events, etc.; simultaneously optimizes abnormal chart analysis, adds data association jump links, abnormal interval prompts, root cause analysis interval prompts, legend display operations, etc.
For more details, please refer to Intelligent Inspection.
Optimized Monitoring¶
Adjusted Group to Alert Policy¶
In "Monitoring", the "Group Management" page is adjusted to "Alert Policy Management", by creating new alert policies to filter monitors and set alert notifications.
In Monitors, the original "Group" filter is adjusted to "Alert Policy" filter, supports selecting alert policies when configuring monitors, supports clicking the monitor's alert policy to modify alert settings.
Optimized Monitor Configuration¶
-
Adjusted the monitor configuration process steps, including detection frequency, detection interval, alert policy, etc.;
-
Optimized detection metrics, supports configuring outer functions (fx), including "Label Filtering", "Conversion Functions", "Data Gap Filling"; supports switching to DQL query mode;
-
Optimized trigger condition text prompts;
-
Optimized event notification, removed some monitor custom data gap event titles and content; adjusted template variable prompt text; adjusted group to alert policy, and supports editing alert policies;
-
Optimized log detection, application performance metric detection, user access metric detection "Detection Metric" configuration.
For more details, please refer to Monitor Management.
New "Infrastructure Survival Detection"¶
"Infrastructure Survival Detection" is used to monitor the running status of infrastructure. For more details, please refer to Infrastructure Survival Detection.
New "Process Anomaly Detection"¶
"Process Anomaly Detection" is used to monitor process data within the workspace, supports setting trigger alerts for one or more field types of process data. For more details, please refer to Process Anomaly Detection.
Note: Infrastructure object detection has been removed. Existing related monitors can continue to be used, but new creation is no longer supported.
Optimized "Application Performance Metric Detection"¶
"Application Performance Metric Detection" adds "Trace Statistics" detection method, counting the number of traces that meet the conditions within a specified time, triggering abnormal events when exceeding the custom threshold. Can be used for service trace abnormal error notification. For more details, please refer to Application Performance Metric Detection
Optimized Member Management¶
- Read-only members no longer support viewing the member management page;
- Supports filtering ordinary members and SSO members, SSO members only display members corresponding to the currently configured identity provider (IDP).
For more details, please refer to Member Management.
New DEMO Workspace¶
Guance provides a workspace Demo to help you better use the various functional modules of Guance. You can click "Workspace" in the upper left corner, and then click "Experience Demo Space" in the pop-up dialog box to enter the Demo space for viewing.
Other Feature Optimizations¶
- The metric aggregation function in charts changes from the default last to avg, the log data aggregation function changes from the default last to count
- Optimized time series chart, pie chart legend copy experience
- Optimized interactive display in note editing mode
- Snapshots support saving the current explorer's display column information
- The time field on the trace details page is formatted for display, converting the timestamp into date format for display
- Deployment version management backend supports modifying the workspace's data retention policy
DataKit Updates¶
Breaking changes¶
- In Gitlab and Jenkins collectors, the time fields related to CI/CD data have been adjusted to unify the data display effect on the front-end page
Collector Function Adjustment¶
- Optimized IO module data processing, improving data throughput efficiency
- Added disk cache function to various types of Trace data
- DataKit's own measurement set adds measurement set related to goroutine usage (
datakit_goroutine) - MySQL collector adds
mysql_dbm_activitymeasurement set - Added netstat collector
- TDengine adds log collection
- Optimized fstype filtering in the disk collector, only collecting common file systems by default
- In the log collector, for each log, add the field
message_lengthto represent the current log length, facilitating filtering logs by length - CRD supports locating Pod range through DaemonSet
- eBPF removes go-bindata dependency
- The container collector by default will open k8s and container related metrics, which will consume additional time series to a certain extent
Bug Fixes¶
- Fixed DataKit's own CPU usage calculation error
- Fixed SkyWalking middleware identification issue
- Fixed Oracle exit issue
- Fixed Sink DataWay failure issue
- Fixed HTTP /v1/write/:category interface JSON writing issue
Document Adjustment¶
- Almost every chapter has added jump labels for permanent reference by other documents
- pythond documents have been moved to the custom development directory
- Collector documents have been migrated from the original "Integration" to the "DataKit" documentation library
- DataKit document directory structure adjustment, reducing directory levels
- Almost every collector has added k8s configuration entry
- Adjusted document header display, in addition to the operating system identifier, for collectors that support election, an election identifier is added
For more DataKit updates, please refer to DataKit Version History.
Best Practices Updates¶
-
Cloud Native
- Using CRD to Start Your Ingress Observability Journey
-
Monitoring
- Application Performance Monitoring (APM) - DDtrace Custom Instrumentation
- Application Performance Monitoring (APM) - DDtrace Guance Secondary Development Practice
For more best practices updates, please refer to Best Practices Version History.
August 18, 2022¶
Guance Updates¶
Guance Billing Update¶
Guance billing items application performance, user access, logs add data retention policies and corresponding unit prices. Log data adds a 7-day data retention policy, application performance and user access add a 3-day data retention policy. You can adjust the corresponding data retention policy according to actual needs to save cost. For more details, please refer to the document Billing Methods.
Optimized Explorer¶
New Filter History¶
Guance adds an explorer filter history function, supporting the current user to save the explorer's key:value search condition history in the local browser. In the current workspace, you can directly use historical search conditions in different explorers, helping to avoid repeated configuration of filter conditions.
- Open filter history: Support by clicking the expand icon in the lower right corner of the explorer, or directly using the shortcut key (Mac OS: shift+cmd+k / Windows: shift+ctrl+k) to quickly open the filter history;
- Close filter history: Click the close button
xor use theesckey to close the filter history.
For more details, please refer to the document Filter History Instructions.
New Quick Filter Value Sorting¶
In the explorer quick filter, added support for sorting field values by percentage. Click the "Settings" button in the upper right corner of the quick filter item, select "Query Value TOP 5", to view the statistical count percentage of the top five field attribute values for the current filter item. On the right side of the top list, support clicking "Positive Filter", "Negative Filter" buttons to perform data filter query on the field attribute values of the current ranking in the form of key:value. For more explorer quick filter details, please refer to the document Quick Filter Instructions.
New Time Widget Input Format Hint Page¶
In the explorer time widget, added a time widget input format hint page, optimized the text input box of the time widget, supports real-time linkage with the hint box, helping users quickly understand the format types supported by the time widget input. For more explorer time widget details, please refer to the document Time Widget Instructions.
New Display Column Field Divider and Text Prompt¶
When adding display columns in the explorer, added field dividers and their add text prompts. When entering fields in the display column, preset fields are distinguished from existing fields by dividers, and the text "Create and Add" is prompted, support selecting to add display columns by using the keyboard up and down toggle keys (↑ ↓); for more explorer display column details, please refer to the document Display Column Instructions.
New Scene Dashboard/Note/Explorer View Permissions.¶
When creating a new Dashboard, Note, Explorer in a scene, supports the creator customizing view permissions to prevent the creator's own dashboard from being mistakenly moved, or to publish dashboards / notes / explorers that are not fully built or not convenient to publish in advance. Permission settings include "Public" and "Only Visible to Me".
- Public: Open to all members in the current workspace, other members' viewing and editing permissions are not affected;
- Only Visible to Me: Only the creator is visible, other members do not have viewing permissions.
Note: When accessing non-public dashboards/notes/explorers through links, etc., non-creators cannot see them, and the user is prompted that they have no permission to view.
New Snapshot View Permissions¶
When saving a snapshot in a scene or explorer, supports customizing view permissions. Permission settings include "Public" and "Only Visible to Me".
- Public: Indicates that users of the current workspace can view the saved snapshot;
- Only Visible to Me: Indicates that except for the current user, other users cannot view the saved snapshot
After saving the snapshot in the explorer, supports viewing the time range and filter conditions of the historical snapshot. The time range is divided into "Absolute Time", "Relative Time" and "Default" based on the selection when saving the snapshot. For more snapshot usage details, please refer to the document Snapshot.
Optimized Monitors and Events¶
New Monitor Test¶
Monitors add manual trigger monitor detection. If the current detection rule is triggered, you can view related details in the event explorer.
Optimized Intelligent Inspection Information Display¶
Intelligent inspection list adds "Data Range", "Detection Frequency", "Group", "Last Trigger Time", "Trigger Status". Operations add "Export JSON Configuration", export file name format: intelligent inspection name.json.
Intelligent inspection event details page adds general pages such as "Basic Attributes", "History Records", "Associated Events", etc.
Optimized Event Details Page¶
On the event details page, optimized the "Basic Attributes" page; "Copy Current Event Content" is adjusted to "Export Event JSON" button, and a "View Monitor Configuration" button is added on the left, supporting one-click jump to the monitor configuration page of the current event, facilitating real-time adjustment of monitor rules.
If the monitor is configured with alerts, the associated event details page adds an "Alert Notification" page. If the monitor is set to mute alerts, during the mute period, the same alert event will not be sent to the notification targets. On the "Alert Notification" page of the event details page, this alert notification is marked as "Notification Not Sent".
Other Feature Optimizations¶
- Guance adds support for email verification method authentication
- Explorer details page associated network page optimizes host, Pod, Deployment type display;
- When adding charts in dashboards and built-in views, etc., add toggle icons, adjust view variable edit button position;
- Time series chart legend value adds
sumsummation statistics, simultaneously optimizes legend display and interaction; - Monitors, chart query log data filter conditions add wildcard and not wildcard.
DataKit Updates¶
New Features¶
- Pipeline adds reftable function
- DataKit 9529 HTTP supports binding to domain socket
- Corresponding eBPF collection and Oracle collection, their configuration methods also need to be changed accordingly.
- RUM sourcemap adds Android R8 support
- CRD adds log configuration support
- Complete example
Optimizations¶
- Optimized container collector documentation
- Added Common Tags documentation
- Optimized election configuration and some related naming
- Election collectors, when DataKit election is enabled, still support turning off the election function on specific collectors
- Supports specifying data type io block configuration
- DDTrace collector sampling adds meta information identification
- DataKit's own measurement set adds 9529 HTTP request related metrics
- Optimized Zipkin collection memory usage
- DDTrace collector, after enabling disk cache, defaults to blocking IO feed
- eBPF adds process name (process_name) field
- DCA new version released
- Log class HTTP data writing (logstreaming/Jaeger/OpenTelemetry/Zipkin) all add queue support
- Log collection adds automatic multi-line support
Bug Fixes¶
- Fixed MySQL collector connection leak issue
- Fixed Pipeline Json value extraction issue
- Fixed ulimit setting invalid issue on macOS
- Fixed sinker-Dataway invalid issue in Kubernetes
- Fixed HTTP data writing class interface data verification issue
- Fixed eBPF collector structure offset calculation failure issue due to kernel changes
- Fixed DDTrace close-resource issue
For more DataKit updates, please refer to DataKit Version History.
Best Practices Updates¶
-
Monitoring
- Using extract + TextMapAdapter to implement custom traceId
-
Insight
- Scene - Intranet Scene Dubbo Microservice Access to Guance
For more best practices updates, please refer to Best Practices Version History.
August 4, 2022¶
Guance Updates¶
DCA Web Version Online¶
Guance adds a DataKit Web management tool DCA, designed to facilitate the management of installed and configured collectors, supporting viewing collector running status, collector configuration management, Pipeline management, blacklist management, and collector documentation help and other functions. For more details, please refer to the document DCA.
Optimized Explorer Search, Quick Filter, Time Widget, Display Columns¶
Explorer Search¶
- In the explorer search bar, added "Exist", "Not exist" filter logic, returning search results by judging whether the filter field exists;
- In the explorer search bar, added operation methods for numeric types, supporting operators including
equal to, greater than or equal to, less than or equal to, greater than, less than, not equal to, etc.; - Added field data type prompts, matching pattern description
For more explorer search details, please refer to the document Search Instructions.
Quick Filter¶
- In the quick filter items, added tag field search function. When there are more than 10 tag fields in the quick filter items, support fuzzy search by "Field Name" or "Display Name";
- In the quick filter items, added a "Settings" button, supporting adding / removing display columns;
- In the quick filter items, added a "Settings" button, personal level filter items added custom support editing display names and deleting fields.
For more explorer quick filter details, please refer to the document Quick Filter Instructions.
Time Widget¶
- Added interval display, supports clicking the display box of the time widget to manually input the time range, including "Interval Range", "Timestamp Range" and "Standard Time Format";
- The drop-down list adds custom options, the custom range supports
hour:minute:secondformat input; - Added viewing of the last 20 historical absolute time records.
For more explorer time widget details, please refer to the document Time Widget Instructions.
Display Columns¶
In the explorer list, when the mouse is placed on the display column, a "Settings" button is added, supporting operations on the display column such as ascending order, descending order, move column left, move column right, add column left, add column right, replace column, add to quick filter, add to group, remove column, etc. For more explorer display column details, please refer to the document Display Column Instructions.
Optimized Explorer Details Page¶
- Adjusted the position of operations such as "Filter Field Value", "Reverse Filter Field Value", "Add to Display Column" and "Copy" for extended field quick filtering;
- Supports switching to view the previous and next log details through the keyboard up and down toggle keys (↑ ↓);
- Supports directly closing the details page pop-up window through the keyboard esc button.
- Added "View Context" function: When viewing the context on the log details page and opening a new log explorer, it will use "host", "source", "service", "filename" as filter conditions, and use the current log time as the start time, current log time + 30 minutes as the end time to bring to the time widget for context log filtering. For more details, please refer to the document View Context.
New Global Explorer Auto Refresh Configuration¶
In the Guance workspace, click "Account", you can enable / disable "Explorer Auto Refresh".
- Enable: The explorer's data is automatically refreshed according to the default data refresh time of the time widget, 30 seconds, such as selecting the last 15 minutes, refresh every 30 seconds to display the data of the last 15 minutes
- Disable: The time widget of the explorer turns off the 30-second auto-refresh when entering, such as selecting the last 15 minutes, that is, display the content data of that 15-minute absolute time and no longer automatically refresh, you can click the "Play" button to refresh and view the last 15 minutes of data.
Note: Explorer auto refresh only takes effect on the local browser. For more explorer time widget details, please refer to the document Time Widget Instructions.
New Global Blacklist Function¶
Guance supports filtering out different types of data that meet the conditions by setting a blacklist, that is, after configuring the blacklist, data that meets the conditions will no longer be reported to the Guance workspace, helping you save data storage costs. The global blacklist function currently supports filtering data types including logs, basic objects, resource directories, network, application performance monitoring, user access monitoring, security inspection, events, metrics, Profile. For more details, please refer to the document Blacklist.
New Custom Function Menu¶
Guance supports manually setting the function menu, including hiding/displaying and adjusting the menu order, helping you customize the menu display of the current workspace. In the Guance workspace, click the account in the lower left corner, select "Function Menu Management", and you can adjust the menu. For more details, please refer to the document Function Menu Management.
New Chart Query Alias¶
In scene visualization charts, added chart query alias. After adding an alias, the name of the legend also changes accordingly, making it easier to distinguish related metrics more intuitively. Currently supported charts include time series charts, pie charts, bar charts, scatter charts, bubble charts, funnel charts.
New Time Series Chart, Pie Chart Legend Styles¶
In scene visualization charts, pie charts add support for legend styles including bottom, right, surround, and hide. Time series charts support legend styles including bottom, right, and hide. The legend style is hidden by default in the default state.
Optimized Object Historical Data Retention Policy¶
Guance stores 48 hours of historical data for all objects (hosts, containers, pods) reported to the workspace, making it convenient for users to review the data performance of objects at a certain moment in the past two days. When viewing a certain log, trace, or security inspection data, you can view the object situation at the corresponding moment in the "Attribute View" of the associated host, container, Pod.
Adjusted the Location for Saving Snapshots¶
The "Save Snapshot" button is removed from the explorer top navigation bar and uniformly migrated to "History Snapshots". Scene dashboards and notes can create snapshots through shortcuts. For more details, please refer to the document Snapshot.
Other Feature Optimizations¶
- Time series chart time interval added to millisecond level
- Management backend added workspace-level index configuration adjustment entry
- Log explorer distribution chart added support for custom selection of time interval
- RUM explorer page added current data extended field page display
DataKit Updates¶
- Some data types support caching to disk after sending failure, and sending later
- Supports sending data that meets the conditions to different workspaces through different dataway addresses
- Sourcemap adds Android and iOS support
- Container collector related updates:
- Fixed Kubernetes Node host operating system information collection error
- Prom collection in Kubernetes no longer automatically appends pod related information to avoid time series explosion
- Pod object appends corresponding yaml information
- Pipeline related updates:
- Optimized Pipeline execution steps
- grok() and json() functions perform trim-space operation by default
- DDTrace related updates:
- Fixed potential goroutine leak issue
- Supports configuring disk cache to alleviate memory usage issues
- Other Bug Fixes:
- Optimized line protocol construction
- In log collection, removed periodic cleanup of tail data function to alleviate possible log truncation issues
For more DataKit updates, please refer to DataKit Version History.
July 21, 2022¶
Guance Updates¶
New Intelligent Inspection Function¶
Intelligent inspection is based on Guance's intelligent detection algorithm, supporting automatic detection of infrastructure and application problems, helping users discover problems occurring during IT system operation, quickly locating the cause of abnormal problems through root cause analysis; through Guance's intelligent prediction algorithm, helping users foresee potential problems in infrastructure and applications in advance, assess the impact level of problems on system operation, better determine the priority of troubleshooting work, and reduce the uncertainty of the troubleshooting process.
Intelligent inspection currently supports three inspection templates:
- Memory Leak: Detects whether there is a memory leak problem on the hosts in the current workspace
- Disk Usage Rate: Detects whether the disk usage rate of the hosts in the current workspace is too high
- Application Performance Detection: Detects whether the service QPS, average response time, P90 response time, and error rate in the current workspace have fluctuating changes
Optimized Explorer Search and Filter Function¶
- Explorer search adds "not wildcard reverse fuzzy matching", that is, does not match the fuzzy query results.
-
Explorer quick filter adds workspace-level and personal-level filtering methods
-
Workspace-level filter items: Configured by administrators/owners, click the "Settings" button next to the quick filter to configure workspace-level filter items
- Personal-level filter items: All members can configure quick filter items based on the local browser, click "Edit" on the right side of the quick filter to configure personal-level filter items
For more details, please refer to the document Explorer Search and Filter.
New Modify the Time Range in the URL for Data Query¶
Guance supports directly modifying the time parameter time range of the current workspace explorer/dashboard in the browser's URL for data query, supporting 4 units: second, minute, hour, day, such as time=30s, time=20m, time=6h, time=2d, etc. As shown in the figure below, modify time=2h in the browser, the explorer displays data from the last 2 hours. For more details, please refer to the document URL Time Range.
New Dashboard View Variable Log, Application Performance, User Access, Security Inspection Data Source Configuration¶
Dashboard view variables add "Log", "Application Performance", "User Access", "Security Inspection" data source configuration, simultaneously optimizing the interaction of creating and editing view variables in the dashboard.
- In a dashboard that has not added view variables, the top navigation bar adds an "Add View Variable" button, click to add view variables
- If the dashboard has already added view variables, the view variable name will be directly displayed below the top navigation bar, click the "Edit" button on the right to enter the view variable editing page
For more details, please see the document View Variables.
Optimized Chart Query Interaction¶
- Click the "Separator Bar" between the chart and the query to drag up and down
- Click the "Drag" button on the left side of the query statement to drag up and down to adjust the order of the queries
- The scroll bar only acts on the query statement area and does not affect the chart
New User Access Metric Detection Event Notification Template Variables¶
When configuring notification events in the "User Access Metric Detection (RUM)" monitor, in addition to general template variables, the following template variables are additionally supported. For more details, please refer to the document Event Title, Content Template.
| Template Variable | Type | Description |
|---|---|---|
app_id |
String | Application ID |
app_name |
String | Application Name |
app_type |
String | Application Type |
Optimized Event Content One-Click Open Link¶
When configuring the monitor, add a link in the event content. After the event is triggered, click the link in the "Event Content" on the event details page to open a new page and jump to the page.
DataKit Updates¶
- The built-in timeout for the prom collector is 3 seconds
- Log related issue fixes:
- Added
log_read_offsetfield for log collection - Fixed the bug that log files did not readAll correctly after rotate
- Added
- Container collection related issue fixes:
- Fixed the incompatibility issue with the environment variable
NODE_NAME - k8s auto-discovered prom collector changed to serial, node decentralized collection
- Added log source and multi-line mapping configuration
- Fixed the bug that container logs still used the previous multiline and pipeline after replacing the source
- Corrected container logs, set the file active duration to 12 hours
- Optimized docker container log image field
- Optimized k8s pod object host field
- Fixed the issue that container metrics and object collection did not add host tag
- Fixed the incompatibility issue with the environment variable
- eBPF related:
- Fixed uprobe event name naming conflict issue
- Added more environment variable configurations to facilitate cloud k8s environment deployment
- Optimized APM data receiving interface data processing, alleviating client stalling and memory usage issues
- SQLServer collector fixes:
- Restored TLS1.0 support
- Supports filtering collection through instance to reduce time series consumption
- Pipeline function
adjust_timezone()has been adjusted - IO module optimization, improving overall data processing capability, keeping memory consumption relatively controllable
- Monitor update:
- Fixed possible long-term stalling caused by Monitor when busy
- Optimized Monitor display, added IO module information display, facilitating users to adjust IO module parameters
- Fixed Redis crash issue
- Removed some complicated redundant logs
- Fixed the issue that election collectors did not append host tags in non-election mode
For more DataKit updates, please refer to DataKit Version History.
Best Practices Updates¶
- Cloud Platform Access
- Rancher Deployment DataKit Best Practices
- Tencent Cloud Product Observability Best Practices (Function)
For more best practices updates, please refer to Best Practices Version History.
July 7, 2022¶
Guance Updates¶
Optimized Explorer Positive Selection, Negative Selection, Fuzzy Matching Three Filter Modes¶
Guance explorer supports precise filtering, fuzzy filtering (wildcard) and negative selection according to the method of "field:value", and supports editing and modifying the "field:value" of the filter. For more details, please refer to the document Explorer Search and Filter.
Optimized Explorer Quick Filter¶
In the Guance explorer quick filter, select all by default, support "positive selection", "negative selection", "reset", "All" and "only" multiple filtering. If the search box includes both positive and negative selection at the same time, the corresponding quick filter label is grayed out and cannot be selected. For more details, please refer to the document Quick Filter.
New Explorer Display Column Multiple Quick Operations¶
In the Guance explorer, through "Display Columns", you can add and delete display columns to the explorer list.
- When the mouse is placed on the added display column, it supports operations such as ascending and descending order, moving the column left, moving the column right, removing the column, etc. on the added display column.
- If the content of the display column is not completely displayed, support "double-clicking the dividing line" on the right side of the display column to expand the content of the column.
Optimized Explorer Details Page Attribute/Field Quick Filter¶
On the explorer details page, when you click on attribute fields such as "Host", "Source", it supports "positive filter", "negative filter", "add to display column" and "copy" for quick filtering and viewing; on the log explorer details page, when you select an extended field, it shows "copy", "positive filter", "negative filter" and "add to display column" icons for quick filtering and viewing.
- "Copy", that is, copy the field to the clipboard
- "Positive filter", that is, add the field to the explorer to view all data related to the field
- "Negative filter", that is, add the field to the explorer to view other data except the field
- "Add to display column", that is, add the field to the explorer list for viewing
Optimized History Snapshot Function, Supports Three Time Saving Policies¶
In metrics, logs, events, application performance monitoring, user access monitoring, cloud dial test, security inspection and other explorers, you can click "Save Snapshot" to save the data content displayed by the current explorer, support choosing to turn on/off "Time Filter".
- Snapshot turns on time filter, save according to the time selected on the current page, the shared link does not support switching time widgets
- Snapshot turns on time filter and turns on lock absolute time, convert the time range selected on the current page to "absolute time" for saving
- Snapshot turns off time filter, follow the system default time, the shared snapshot can switch time widgets
After the snapshot is saved, it can be viewed in the explorer history snapshot list. Select and open the history snapshot, click "Return to Explorer" in the upper right corner to return to the default explorer.
New Pipeline One-Click Get Sample Test Data¶
When creating or editing a Pipeline, a "One-Click Get" button is added to the right of the sample parsing test to facilitate you to obtain data for cutting debugging.
- Log data: returns message data
- Other data types: returns line protocol format data
For more details, please refer to the document Text Processing Pipeline.
New Scene Custom Explorer Text Analysis Mode¶
In the scene custom explorer, when editing the explorer, in the custom configuration display column, support selecting "Approximate Text Analysis" explorer field, using the content of the message field for approximate text analysis by default. For more details, please refer to Scene Custom Explorer.
New Log Explorer Details Page Associated Network Pod and Deployment View¶
Guance supports viewing the data connection status of the host, Pod and Deployment views in the "Network" of the log explorer details page.
Note: To view the related network in the log details, the corresponding fields need to be matched.
New Explorer Details Page Associated Network 48-Hour Data Playback Function¶
In the log, process, trace, host, Pod explorer details page associated network, support clicking the time widget to select playback of 48-hour network data.
- Time range: By default, view data from 30 minutes before and after. If it is currently happening, view data from the last 1 hour by default;
- Support arbitrarily dragging the time range to view the corresponding network traffic;
- After dragging, click the "Play" button or refresh the page to return to viewing the network data of the "last 1 hour".
Adjusted Unrecovered Event Saving Policy, Supports Manual Recovery of Events¶
In the unrecovered event list, move the mouse to the event, and you can see the "Recovered" function on the right side of the event. Click "Recovered" to recover the event, and at the same time, a recovered event will be generated, and the corresponding operator can be viewed in the event.
Other Feature Optimizations¶
- Chart lock time adds [Latest 5 minutes] time range, time interval adds [5s] [10s] [30s] three second-level time choices
- Scene explorer display column, view variable basic object field attributes/tags support custom input
- Adjusted event details associated dashboard position
- Added time series count statistics in metric management
- Optimized log details page associated trace, display flame graph and select all data of the corresponding span according to the trace_id and span_id in the log
- Optimized user access monitoring service display and interaction
- RUM, network, availability monitoring, CI explorer drop-down options adjusted to tiled display
- Monitor event notification content supports configuring template variable field mapping, supports configuring template variable corresponding values through DQL query statements.
- Help center homepage adds key function quick jump entry
DataKit Updates¶
- Adjusted the behavior of global tags to avoid tag splitting in election collectors
- SQLServer collector adds election support
- Line protocol filter supports all data types
- 9529 HTTP service adds timeout mechanism
- MySQL
- dbm measurement set name adjustment
- service field conflict issue
- Container object adds field container_runtime_name to distinguish container names at different levels
- Redis adjusts slowlog collection, changes its data to log storage
- Improved TDEngine collection
- Improved Containerd log collection
- Pipeline adds Profile class data support
- Container/Pod log collection supports additionally appending tags on Label/Annotation
- Fixed Jenkins CI data collection time accuracy issue
- Fixed Tracing resource-type value inconsistency issue
- eBPF adds HTTPS support
- Fixed possible log collector crash issue
- Fixed prom collector leak issue
- Supports configuring io disk cache through environment variables
- Added Kubernetes CRD support
For more DataKit updates, please refer to DataKit Version History.
Best Practices Updates¶
- Access Integration
- Skywalking Collection JVM Observability Best Practices
- Minio Observability Best Practices
For more best practices updates, please refer to Best Practices Version History.
June 21, 2022¶
Guance Updates¶
Guance Help Documentation Fully Online¶
To improve the reading experience of the Guance help documentation, the Guance help documentation has been moved under the Guance domain name, allowing you to view the Guance help documentation more simply and quickly. The new Guance help documentation address is: https://docs.guance.com/en.
New Profile Observability¶
Profile supports collecting dynamic performance data during application runtime in different language environments such as Java / Python, helping users view CPU, memory, IO performance issues. Collecting profile data requires first installing DataKit and configuring the Profile collector. After configuration, DataKit will upload the collected profile data to your Guance workspace. You can understand your program code performance through the Profile real-time data viewer. For more details, please refer to the document Profile.
Pipeline Covers Full Data Text Analysis Processing¶
Text processing (Pipeline) is used for data parsing. By defining parsing rules, various data types are cut into structured data that meets our requirements. In the Guance workspace "Management" - "Text Processing (Pipeline)", click "New Pipeline" to create a new pipeline file. For more details, please refer to the document Text Processing (Pipeline).
New Deployment Network Details and Network Distribution¶
Deployment network supports viewing network traffic between Deployments. Supports viewing network traffic trafficand data connections between source IP and target IP based on IP/port, displaying in real-time through visualization, helping enterprises understand the network operation status of business systems in real time, quickly analyze, track, and locate problem faults, and prevent or avoid business problems caused by declining network performance or interruption.
Deployment network data, after successful collection, is reported to the Guance console. You can view the network performance monitoring data information of all Deployments within the workspace in the "Network" section of the "Infrastructure" - "Containers" - "Deployment" details page. For more details, please refer to Deployment Network.
Optimized Event Detection Dimension Jump to Other Explorers¶
In the unrecovered event explorer, support clicking the detection dimension to view related containers, processes, logs, traces, RUM, availability detection, security inspection, CI, etc. If the related explorer has no relevant data, the corresponding jump link is grayed out and unclickable.
New Log Explorer JSON Format Message Information Search¶
The log explorer adds search for JSON format log content (message). The search format is: @key.key:value.
Note: JSON search is only supported for workspaces created after this feature was launched.
New User Access Monitoring Support for Custom app_id Input When Creating Applications¶
Added custom application ID function. Supports custom input of app_id when using User Access Monitoring to create a new application, generating a unique application ID identifier within the current space, which can be used to distinguish application types, data upload matching, etc.
- Application Name (Required): Used to identify the name of the application currently implementing User Access Monitoring.
- Application ID Identifier (Optional): A unique application ID identifier within the current space, supports customization, and can be used for data upload matching; the application ID identifier can be up to 20 characters, only uppercase and lowercase letters are supported.
- Application ID: If an application ID identifier is filled in when creating the application, after clicking create application, the filled application ID identifier information will be prepended to the application ID, making it easier to distinguish applications for query and filtering.
Optimized Process Detection to Infrastructure Object Detection¶
Process detection is optimized to Infrastructure Object Monitoring, adding basic object selections such as Host, Container, Process, Pod, Deployment, Replicaset, Job, Resource Catalog, etc., used to monitor infrastructure object data within the workspace. For more details, please refer to the document Infrastructure Object Detection.
Other Feature Optimizations¶
- Infrastructure POD explorer honeycomb mode adds CPU usage rate, memory usage fill metrics
- Optimized log blacklist configuration. Supports manual input of log sources as sources for log blacklists
- Optimized Application Performance Monitoring service list data query time component, supports custom time range selection
- Optimized the guide text for installing DataKit on K8S, automatically adding the current workspace token in the configured DataWay data gateway address
- Optimized monitor configuration UI style
DataKit Updates (2022/5/12)¶
- gitrepo supports passwordless mode
- prom collector
- Supports log mode collection
- Supports configuring HTTP request headers
- Supports container log collection exceeding 16KB in length
- Supports TDEngine collector
- Pipeline
- Supports XML parsing
- Remote debugging supports multiple data types
- Supports Pipeline calling external Pipeline scripts via the
use()function
- Added IP library (MaxMindIP) support
- Added DDTrace Profile integration
- Containerd log collection supports configuring filtering rules via image and K8s Annotation
- Documentation library overall switch
2022/06/16¶
- Log collection supports recording collection position to avoid data omission due to DataKit restart, etc.
- Adjusted Pipeline settings when processing different types of data
- Supports receiving SkyWalking metric data
- Optimized log blacklist debugging function:
- The number of filtered points will be displayed in the Monitor
- A .filter file will be added in the datakit/data directory to record the fetched filters
- Monitor adds DataKit open file count display
- DataKit compiler upgraded to golang 1.18.3
2022/06/07¶
- Added TCP/UDP port detection collector
- DataKit and DataWay之间增加 DNS 检测,支持 DataWay DNS 动态切换 (Added DNS detection between DataKit and DataWay, supporting DataWay DNS dynamic switching)
- eBPF L4/L7 traffic data adds k8s deployment name field
- Optimized OpenTelemetry metric data
- ElasticSearch adds AWS OpenSearch support
- Line protocol limit, string length limit relaxed to 32MB
- prom collector adds additional configuration, supports ignoring specified tag=value matches to reduce unnecessary time series
- Sink adds Jaeger support
- Kubernetes related metric collection is turned off by default to avoid time series explosion issues
- DataKit Monitor adds dynamically discovered (e.g., prom) collector list refresh
For more DataKit updates, please refer to DataKit Version History.
Best Practices Updates¶
- APM
- GraalVM and Spring Native Project Achieving Trace Observability
- Access Integration
- Host Observability Best Practices (Linux)
For more best practices updates, please refer to Best Practices Version History.
Integration Template Updates¶
New Documents¶
- Alibaba Cloud
- Alibaba Cloud NAT
- Alibaba Cloud CDN
New Views¶
- Alibaba Cloud
- Alibaba Cloud NAT
- Alibaba Cloud CDN
June 6, 2022¶
Guance Billing Update¶
Guance billing optimizes the time series billing logic and the data retention policy for metric data. The original 3 yuan per 300 time series was reduced to 3 yuan per 1000 time series. Meanwhile, metric data adds 3-day, 7-day, 14-day data retention policies, and measurements support custom data retention policies.
Time series statistics refer to the total number of all combinations that can be formed based on tags in the metric data reported to the current workspace. The data retention policy, i.e., data storage duration, is the storage time of data reported to the current workspace. Data exceeding the storage duration will be automatically deleted.
Time series are counted in full, meaning the daily generated time series will accumulate sequentially during the data retention policy period. The longer the data retention policy, the higher the time series cost.
This optimization, by reducing the time series cost and shortening the metric data retention policy, allows you to more flexibly adjust the metric data retention policy, helping you save costs.
Guance Updates¶
New Jenkins CI Observability¶
Guance adds Jenkins CI observability. You can directly view the CI results in Jenkins through the CI visualization function of Guance. The CI process is continuous integration. When developers push code and encounter problems, they can view all CI pipelines, their success rates, failure reasons, and specific failure links in Guance, helping you provide code update guarantees. For more details, please refer to CI Visualization.
New Custom Explorer Chart Synchronized Search¶
Custom explorers add a chart synchronized search switch, used to determine whether the search conditions affect the chart query. It is enabled by default. When the search box has content, turning off the switch means the chart query returns to the default state; turning on the switch means the chart query is affected by the filter content.
New Network Topology and Service Topology Drill-down Analysis¶
In the infrastructure network topology map, click the host/Pod icon, click "View Upstream/Downstream" to view the current node's associated upstream and downstream nodes. In the upstream/downstream nodes, click "Return to Overview" in the upper left corner to return to the original network topology map. Searching or filtering in the search box can filter associated upstream/downstream nodes, and the matching associated upstream/downstream nodes are displayed based on the search or filter results. For more details, please refer to the document Network View Upstream/Downstream.
In the service topology map, click the service icon, click "View Upstream/Downstream" to view the current service's associated upstream and downstream services. In the upstream/downstream services, click "Return to Overview" in the upper left corner to return to the original service topology map. Searching or filtering in the search box can filter associated upstream/downstream services, and the matching associated upstream/downstream services are displayed based on the search or filter results. For more details, please refer to the document Service View Upstream/Downstream.
New Delete Resource Directory Data and Index¶
Guance supports owners and administrators deleting specified resource directory classifications and all resource directories. Enter "Management" - "Basic Settings", click "Delete Resource Directory", then choose the method to delete the resource directory to delete the corresponding object data.
- Specified Resource Directory Classification: Only deletes data under the selected object classification, does not delete the index
- All Resource Directories: Deletes all resource directory data and index
Note: Once all resource directories are deleted, all data and indexes of custom infrastructure uploaded will be deleted and cannot be recovered. All set resource directory classification data needs to be re-uploaded. Deleting all resource directories is limited to 5 times per day.
For more introduction to resource directories, please refer to the document Custom.
New Explorer Snapshot View Entry¶
In Guance metrics, logs, events, application performance monitoring, user access monitoring, cloud dial test, security inspection, CI visualization and other explorers, after saving a snapshot, you can directly click the view snapshot icon in the upper right corner to slide out and view the saved snapshots.
- Supports snapshot name keyword search, fuzzy matching related snapshot names through keywords
- The first one is the default view and does not support sharing, copying links, or deleting functions
- Except for the first default view, other snapshots support sharing, copying links, and deleting functions. Click the "Snapshot Name" to open the corresponding data copy in the current explorer
New Explorer Filter Condition Editing Function¶
In the Guance explorer search bar, when filtering by "field:value", support clicking "field:value" to edit and modify the "field:value", and filter with the modified result.
Optimized User Access View Explorer Associated Trace to Fetch/XHR¶
In the Guance User Access Monitoring View explorer details page, when switching to "Fetch/XHR", it supports viewing every network request made by the user to the backend application during access, including occurrence time, requested trace, and duration.
If the network request has a corresponding trace_id, there will be a prompt icon before the request. Click the request to jump to the details page of the corresponding trace.
New Chart Data Loading High Performance Mode¶
Guance supports chart data loading high performance mode, which is turned off by default. It can be enabled by clicking the account in the lower left corner and selecting "High Performance Mode". After enabling high performance mode, all charts are not loaded dynamically; they are all loaded directly when clicking into the page, meaning when you scroll down beyond the current page usage, the charts are already fully loaded and can be viewed directly.
Note: Enabling high performance mode only affects the current user viewing charts.
New Alert Configuration Event Notification Level¶
Alert configuration supports custom selection of event notification levels, including Emergency, Important, Warning, Recovery, Data Gap, Data Gap Recovery, Data Gap Considered as Recovery 7 options, supports multiple selections, supports one-click clearing of options. After clearing, options are cleared and need to be manually selected again. For more alert configuration details, please refer to the document Alert Settings.
Other Feature Optimizations¶
- Scene dashboard combination chart supports hiding/displaying the large title
- Optimized event details page event type text display
- Infrastructure list view adds sorting by field function
- Log explorer adds hide distribution chart button
- Explorer supports searching display columns by keyword, supports custom display columns as preset fields. Subsequent data can be displayed directly after being cut and uploaded by Pipeline.
- Added one-click viewing of corresponding integration documents in the built-in template library and built-in views to help you quickly configure the corresponding collector
- Built-in views, in addition to supporting binding trace services, applications, log sources, projects, labels and other related views in the explorer, now support custom key and value binding related views, and also support service slide details page binding built-in views
- Optimized notification object Lark robot, supports custom whether key security verification is needed
- When configuring a monitor, if the configured data range is smaller than the detection frequency, a prompt will be triggered indicating that there may be data gap issues
DataKit Updates (2022/05/26)¶
- Pipeline adjustments: All data types can be additionally processed by configuring Pipeline
- grok() supports directly extracting fields as specified types, no longer needing additional type conversion via the cast() function
- Pipeline adds multi-line string support, improving readability for very long strings (e.g., regular expressions in grok切割)
- The running status of each Pipeline can be viewed directly via
datakit monitor -V - Added Kubernetes Pod object CPU/memory metrics
- Helm adds more Kubernetes version installation adaptations
- Optimized OpenTelemetry, HTTP protocol adds JSON support
- DataKit增加了对行协议纠错行为的日志记录,便于调试数据问题 (DataKit adds logging for line protocol correction behavior to facilitate debugging data issues)
- Removed all string metrics from time series data
- In DaemonSet installation, if an election namespace is configured, specific tags (election_namespace) will be added to the data of participating election collectors
- CI observability adds Jenkins support
Breaking changes¶
For Docker container log collection, the host (Node) path /varl/lib needs to be mounted into DataKit (because Docker logs default to the host's /var/lib/). In datakit.yaml, add the following configuration to volumeMounts and volumes:
volumeMounts:
- mountPath: /var/lib
name: lib
# Omit other parts...
volumes:
- hostPath:
path: /var/lib
name: lib
Best Practices Updates¶
- APM
- Based on Guance, using SkyWalking to achieve RUM, APM and log linkage analysis
- Monitoring Best Practices
- OpenTelemetry Observability Construction
- OpenTelemetry to Jaeger, Grafana, ELK
- OpenTelemetry to Grafana
- OpenTelemetry to Guance
- Guance Tips
- OpenTelemetry Sampling Best Practices
Integration Template Updates¶
New Documents and Views¶
- Data Collection
- Opentelemetry Collector
- Container Orchestration
- Kubernetes Scheduler
- Kubernetes Controller Manager
- Kubernetes API Server
- Kubernetes Kubelet
New Views¶
- Container Orchestration
- Kubernetes Nodes Overview
- Middleware
- JVM Kubernetes
May 19, 2022¶
Guance Updates¶
Optimized Guance Commercial Version Registration Process¶
When registering for Guance, it provides Free Plan and Commercial Plan registration options. You can register for a Guance account according to your actual needs. Guance supports pay-as-you-go, usage-based billing, providing you with an out-of-the-box cloud platform for comprehensive observability.
New Scene Dashboard User View Template Library¶
Guance has over 60 built-in system view templates, ready to use without configuration, meeting your various monitoring scene needs. You can customize views as user view templates to create dashboards with one click.
New Scene Custom Explorer Log Source and Filter Linkage¶
The data range of scene custom explorers adds a filter function. Based on the log source, filter the field data under that log source, supports matching multiple field data. Combined with search and grouping, it can further filter log data.
New Event Details Page Content Copy as Json Format¶
On the event details page, support clicking the "Copy Complete Event Content" button to obtain all key data corresponding to the current event. If a dashboard is associated when configuring the monitor, you can click the "Associated Dashboard" button to jump to the corresponding dashboard.
In the Associated Information of the event details page, if "Log Detection" configures multiple expression queries (same object type), the associated information supports tab switching for multiple expression queries. If there are two expression queries A and B, then the relationship information contains two tabs, A and B, for switching and viewing.
New Log Data Desensitization Processing¶
Guance adds log data desensitization processing. After data is collected and reported to the Guance workspace, some data may contain sensitive information, such as IP addresses, user information, etc. For this part of information, desensitization processing can be done by configuring sensitive fields.
Note:
- Desensitized data can only be viewed by workspace administrators and above members. Standard and read-only members cannot view desensitized information.
- Configuring sensitive fields is only supported by workspace administrators and above members. Standard and read-only members can only view configured sensitive fields.
Optimized Log Explorer and Details Page¶
Guance Log Explorer displays "time" and "message" fields by default. This optimization supports hiding the "message" field display.
On the log details page, the log content automatically displays Json and text两种 viewing modes based on the message type. If the log has no message field, the log content part is not displayed. The log content supports expand/collapse, is expanded by default, and only shows 1 line height when collapsed.
Extended fields display all related fields of the log, supporting "Copy" and "Add to Filter" for quick filtering and viewing.
New Network Data Detection Monitor¶
Network Data Detection is used to monitor the metric data of network performance within the workspace. By setting threshold ranges, alerts are triggered when metrics reach the threshold. "Guance" supports setting alerts for a single metric and custom alert levels. In Monitors, click "+New Monitor", select "Network Data Detection", and enter the detection rule configuration page.
Optimized Built-in View Binding Function¶
Guance built-in views include system views and user views. This optimization cancels custom binding system views as explorer views, only supports binding user views as explorer views. If you need to bind system views, you can first clone the system view as a user view. If system views and user views have the same name, the user view is prioritized displayed in the explorer. For how to bind user views as explorer views, please refer to the document Bind Built-in Views.
Other Feature Optimizations¶
- Billing Plan and Bill added prepaid card balance
- Infrastructure details style optimization
- Trace details page attribute line break display optimization
- Monitor configuration template variable display optimization
- Added shortcut entry, DQL query and snapshot menu moved under shortcut entry
- Guance management backend supplemented template management classification information
DataKit Updates (2022/5/12)¶
- eBPF adds arm64 support
- Line protocol construction supports automatic error correction
- DataKit main configuration adds example configuration
- Prometheus Remote Write supports tag renaming
- Merged community版 DataKit existing functions, mainly including Sinker function and filebeat collector
- Adjusted container log collection, DataKit directly supports containerd container stdout/stderr log collection
- Adjusted hostname acquisition strategy in DaemonSet mode
- Trace collector supports filtering resources (
resource) through service name (service) wildcard
For more DataKit updates, please refer to DataKit Version History.
Best Practices Updates¶
- Cloud Native
- Using Guance to Start Your Rancher Observability Journey
- Microservice Observability Best Practices
- Kubernetes Cluster Application Using SkyWalking to Collect Trace Data
- Kubernetes Cluster Log Reporting to Same Node DataKit Best Practices
- Gitlab-CI Observability Best Practices
- Gitlab-CI Observability Best Practices
For more best practices updates, please refer to Best Practices Version History.
Integration Template Updates¶
New Documents and Views¶
- Middleware
- Resin
- Beats
- Host System
- Procstat
New Views¶
- Container Orchestration
- Istio Service
- Alibaba Cloud
- ASM Service
May 6, 2022¶
Guance Updates¶
Optimized Guance Commercial Version Upgrade Process¶
Guance upgrade to commercial version默认开通 Guance Billing Center Account Settlement, supports changing the settlement method to cloud account settlement, including Alibaba Cloud Account and AWS Cloud Account settlement methods.
New Process, Log, Trace Details Page Associated Network¶
Guance Process, Log, Trace details page新增关联网络数据分析,支持基于 IP/端口查看源主机/源进程服务到目标之间的网络流量和数据连接情况,通过可视化的方式进行实时展示,帮助企业实时了解业务系统的网络运行状态,快速分析、追踪和定位问题故障,预防或避免因网络性能下降或中断而导致的业务问题。
Scene Module Optimization¶
Optimized Dashboard, Removed Edit Mode¶
In the scene Dashboard top navigation bar, removed the "Edit" button, added "Add Chart" to add new charts to the dashboard. After adding charts, click "Finish Adding" in the upper right corner.
In Charts, click the "Settings" button, select "Modify", to edit the chart.
New Chart Link Display Switch¶
Guance supports built-in chart links and custom links, helping you jump from the current chart to the target page. Built-in links are association links provided by Guance by default for charts, mainly based on the current query time range and grouping labels, helping you view corresponding logs, processes, containers, traces. The built-in link display switch is off by default and can be enabled when editing the chart; custom links are enabled by default after creation. For more details, please refer to the document Chart Links.
Optimized DQL Query and Simple Query Conversion¶
Click the switch button on the right side of DQL Query to switch DQL query to simple query.
Note: When DQL Query is switched to Simple Query, if it cannot be parsed or the parsing is incomplete:
- If no operation is performed under Simple Query, switching back to DQL Query will display the previous DQL query statement;
- If the query statement is adjusted under Simple Query, switching back to DQL Query will parse according to the latest Simple Query.
For more applications of DQL query and simple query, please refer to the document Chart Query.
Monitor and Event Module Optimization¶
New Event Association Information¶
Guance supports viewing information related to the event that triggered the current event, such as viewing logs related to the triggered event. This Association Information only supports events generated by 4 types of monitors: log detection, security inspection anomaly detection, process anomaly detection, and availability data detection.
New Data Gap Event Name and Content Configuration¶
Guance monitors "Threshold Detection", "Water Level Detection", "Range Detection", "Mutation Detection", "Process Anomaly Detection", "Application Performance Metric Detection", "User Access Metric Detection" add data gap event title and content configuration. It is not fillable by default. When "Trigger Data Gap Event" is selected, the data gap event name becomes fillable, supporting the use of preset template variables. For details, refer to Event Title/Content Template.
Optimized Availability Data Detection¶
Guance monitor Availability Data Detection优化支持选择 HTTP、TCP、ICMP、WEBSOCKET 拨测类型。
Optimized Alert Notification Template, Added Association Jump Link¶
Email, DingTalk, WeChat, Lark received alert notifications contain "Guance jump link". Clicking can directly jump to the corresponding Guance event details. The time range is fixed to 15 minutes before the current time, i.e., an event at 18:45:00, after clicking the link, jumps to the event details page with a fixed time range of 4.20 18:30:00 ~ 4.20 18:45:00. For more alert notification details, please refer to the document Alert Settings.
Other Feature Optimizations¶
- Optimized service servicemap metric query performance
- Added explorer numerical field support for > | >= | < | <= | [] 5 writing methods
- Added metric explorer tag support for cascading filtering
- Optimized DQL query return error prompt
DataKit Updates¶
- Process collector filtering function only works on metric collection, object collection is not affected
- Optimized DataKit sending DataWay timeout issue
- Optimized Gitlab collector
- Fixed log collection truncation issue
- Fixed various trace collectors reload后部分配置不生效的问题 (Fixed the issue that some configurations did not take effect after reload for various trace collectors)
For more DataKit updates, please refer to DataKit Version History.
Integration Template Updates¶
New Data Storage Redis Sentinel Integration Document and View¶
Redis-sentinel observation scene mainly displays Redis cluster, slaves, node distribution information, etc.
April 26, 2022¶
Guance Updates¶
Optimized SSO Single Sign-On¶
Guance supports enabling SSO single sign-on based on the workspace. Users can obtain the corresponding SSO login by entering the company email during login, achieving corresponding verification login. In the Guance workspace "Management" - "SSO Management" - "Enable", you can set up SSO single sign-on for employees. The main optimizations of this update include the following points:
- A workspace is updated from supporting the creation of multiple identity providers to only supporting the configuration of one SSO single sign-on. The SAML2.0 configuration you last updated will be regarded as the final single sign-on verification entry by default.
- If multiple workspaces configure the same identity provider (IdP) data, you can switch to view the data of the corresponding workspace after logging in through SSO single sign-on.
- When configuring SSO single sign-on, the "User Whitelist" configuration is replaced with "Email Domain". You only need to configure the suffix domain of the email to match the user email suffix configured by the identity provider (IdP) for single sign-on.
- SSO single sign-on enablement, configuration updates, deletion support email notification and generate audit events.
- SSO single sign-on users support deletion and editing. When editing, permissions can be upgraded to "Administrator".
For more SSO single sign-on details, please refer to the document SSO Management.
April 21, 2022¶
Guance Community Edition Online¶
The Guance Community Edition provides teachers, students, cloud computing enthusiasts and other community users with a simple, easy-to-obtain yet fully functional localized deployment platform. Welcome to apply for free download and trial, build your own Guance platform, and experience the complete product features.
Guance Updates¶
New Gitlab CI Observability¶
Guance supports visualization of the process and results of CI built into Gitlab. You can directly view the CI results in Gitlab through the CI visualization function of Guance. The CI process is continuous integration. When developers push code and encounter problems, they can view all CI pipelines, their success rates, failure reasons, and specific failure links in Guance, helping you provide code update guarantees. For more details, please refer to CI Explorer.
New Online Help Obus Assistant¶
The Guance Obus Assistant supports you to quickly view basic入门, advanced guides, best practices, DataKit, Func and other documents in the workspace. By clicking the provided keywords or directly entering keywords in the search bar for search, it helps you quickly obtain related document instructions. For more details, please refer to the document Help.
New Dashboard Set Refresh Frequency¶
Guance supports setting the refresh frequency on the scene dashboard. The initial refresh frequency setting defaults to 30 seconds, supporting 10 seconds, 30 seconds, 60 seconds三种选项. If the time widget is "Paused", it will not refresh.
New Process 48-Hour Playback¶
Guance infrastructure processes support viewing process data collected within the last ten minutes. Click the time range to view process 48-hour playback. After dragging, the refresh pauses, and the time displays as: [ Start Time - End Time ]. The query time range is 5 minutes. Click the "Play" button or refresh the page to return to viewing the processes from the "last 10 minutes".
New Integration DataKit Kubernetes(Helm) Installation Guide Page¶
In the Guance Integration DataKit installation guide page, added Kubernetes(Helm) installation guide, introducing how to use Helm to install DataKit in K8S.
New Application Performance Global Overview, Service Classification Filter, Service Topology Map Distinguishing Environment and Version¶
Application Performance Monitoring adds a global performance overview view. You can view the number of online services, P90 service response time, service maximum impact time, service error count, service error rate statistics on the overview page. You can also view the Top10 rankings of P90 service, resource, operation response time, as well as the Top10 rankings of service error rate, resource 5xx error rate, resource 4xx error rate.
In the Application Performance service list,支持您通过点击服务类型图标进行分类筛选,再次点击即可恢复全部查看。(supports you to filter by category by clicking the service type icon. Click again to restore viewing all.)
In the service list, switching to "Topology Map" mode allows you to view the calling relationships between various services. Supports drawing service topology maps based on two dimensions: service (service) and service environment version (service+env+version). After enabling "Distinguish Environment and Version", the service topology map will be drawn according to different environment versions. For example, canary release. By enabling environment and version, you can view the service调用情况 under different environment versions.
Optimized SSO Single Sign-On Configuration¶
SSO single sign-on configuration user whitelist adjusted to email domain, used to verify whether the email suffix entered at the single sign-on matches. Matching emails can obtain the SSO login link online. For more SSO configuration details, please refer to the document SSO Management.
Other Feature Optimizations¶
- Added trace details page associated log "All Sources" option
- Added metric filter support for negative selection, aggregation function position adjustment
- Optimized log, application performance, user access, security inspection generate metrics, the selected time for "Frequency" is also used as the aggregation cycle
- Optimized Guance deployment version workspace owner transfer permission function canceled, supports management backend settings
- Optimized alert notification SMS template
- Optimized availability monitoring new dial test list, supports directly selecting HTTP, TCP, ICMP, WEBSOCKET dial test
DataKit Updates¶
- Pipeline module fixed Grok dynamic multi-line pattern issue
- DaemonSet optimized Helm installation, added environment variable configuration to enable pprof. All default enabled collectors in DaemonSet support configuration through environment variables.
- Tracing collector初步支持 Pipeline 数据处理。(Preliminarily supports Pipeline data processing.)
- Dial test collector adds failure task exit mechanism
- Logs added
unknownlevel (status). Logs without a specified level are allunknown - Container collector fixes:
- Fixed cluster field naming issue
- Fixed namespace field naming issue
- In container log collection, if the Pod Annotation does not specify the log
source, then DataKit will derive the log source according to this priority - Object reporting is no longer subject to the 32KB length limit (because Annotation content exceeds 32KB). All Kubernetes objects have the
annotationfield removed.
For more DataKit updates, please refer to DataKit Version History.
Best Practices Updates¶
- Microservice Observability Best Practices
- Service Mesh Microservice Architecture from Development to Canary Release Full Process Best Practices (Part 1)
- Service Mesh Microservice Architecture from Development to Canary Release Full Process Best Practices (Part 2)
- Service Mesh Microservice Architecture from Development to Canary Release Full Process Best Practices (Part 3)
- Monitoring Best Practices
- JAVA OOM Exception Observability Best Practices
For more best practices updates, please refer to Best Practices Version History.
Integration Template Updates¶
New Documents¶
- Application Performance Monitoring (APM)
- Node.JS
- Ruby
- Middleware
- RocketMQ
- Container Orchestration
- Istio
- Kube State Metrics
- Data Storage
- Aerospike
New Views¶
- Container Orchestration
- Kubernetes Overview by Pods
- Istio Mesh
- Istio Control Plane
- Alibaba Cloud
- Alibaba Cloud ASM Mesh
- Alibaba Cloud ASM Control Plane
- Alibaba Cloud ASM Workload
- Middleware
- RocketMQ
April 8, 2022¶
Guance Billing Update¶
New Alibaba Cloud Account Settlement Method¶
In the Guance Billing Center "Manage Workspace", you can view all workspaces bound under the account, and supports modifying the settlement method of the bound workspaces. Through "Change Settlement Method", you can arbitrarily switch between Guance Billing Center account, Amazon Cloud account, and Alibaba Cloud account settlement.
Guance Updates¶
New DQL Query Explorer¶
DQL is a language developed specifically for Guance. Its syntax is simple and easy to use. You can perform data queries using the DQL language in the Guance workspace or on terminal devices.
In the Guance workspace, click the menu bar "DQL Query" to open the DQL query explorer, or you can directly open DQL query with the shortcut key Alt+Q. The DQL query explorer supports table and JSON两种 return results, supports saving 7 days of historical query records.
Availability Monitoring New TCP/ICMP/Websocket Dial Test Protocol¶
Guance supports custom dial test tasks. By creating dial test tasks based on different protocols such as HTTP, TCP, ICMP, WEBSOCKET, etc., it comprehensively monitors the network performance, network quality, network data transmission stability, etc., from different regions and different operators to various services.
New Infrastructure Network Module¶
In Infrastructure, the original host network Map and Pod network Map are moved from the host and container modules to the new module "Network", supporting viewing host and Pod network Maps. Pod network Map fill metrics新增七层网络指标:每秒请求数、错误率以及平均响应时间。(added L7 network metrics: requests per second, error rate, and average response time.) For more details, please refer to the document Network.
Infrastructure Container Pod New HTTP L7 Network Data Display¶
Infrastructure container Pod adds HTTP L7 network data collection and display. Pod network data, after successful collection, is reported to the Guance workspace. In the "Network" section of the "Infrastructure" - "Containers" - "Pod" details page, you can view the network performance monitoring data information of all Pods within the workspace. For more basic details, please refer to the document Pod Network.
New Explorer Quick Filter "Negative Selection" and "Reset" Functions¶
In any explorer's "Quick Filter",支持在选择字段筛选内容时进行“反选”或者“重置”,“反选”表示选中的字段筛选内容不展示(再次点击“反选”可返回字段选中状态),“重置”可清空筛选条件。(supports "negative selection" or "reset" when selecting field filter content. "Negative selection" means the selected field filter content is not displayed (click "negative selection" again to return to the field selected state). "Reset" can clear the filter conditions.) For more details, please refer to *Log Explorer Quick Filter *.
Optimized Log Blacklist¶
Guance supports filtering out logs that meet the conditions by setting a log blacklist. That is, after configuring the log blacklist, log data that meets the conditions will no longer be reported to the Guance workspace, helping users save log data storage costs.
The log blacklist optimization supports matching all log sources and supports two blacklist configuration methods:
- Meet any one filter condition, trigger blacklist filtering
- Meet all filter conditions, trigger blacklist filtering
For more blacklist configuration details, please refer to the document Log Blacklist.
Other Feature Optimizations¶
- Added trace details page span count statistics
- Optimized trace associated host time series drawing method
- Optimized overview chart time slicing, canceled the option. If the overview chart had time slicing enabled before optimization, it defaults to not enabling time slicing after optimization.
- Optimized combination chart in browser zoom situation, unable to switch and edit different chart queries after entering edit mode
- Optimized log explorer manual pause page refresh后,滚轴滑动到顶部不触发自动刷新。(after manually pausing page refresh in the log explorer, scrolling to the top does not trigger automatic refresh.)
DataKit Updates¶
- Added host runtime memory limit, memory limit configuration is supported during the installation phase.
- CPU collector adds load5s metric
- Supports Guance optimized log blacklist function, adjusted monitor layout, added blacklist filtering situation display
- DaemonSet installation adds Helm support, added DaemonSet installation best practices
- eBPF adds HTTP protocol collection. During host installation, the eBPF collector is not installed by default. If installation is needed, specific installation instructions must be used. DaemonSet installation is not affected.
For more DataKit updates, please refer to DataKit Version History.
Guance Mobile APP Update¶
Added site login capability, optimized scene and event explorers, maintaining the same access experience as the web version explorers.
- Supports users selecting the site corresponding to the account and logging in via account password or verification code.
- Supports users viewing log data from all sources or any source
- Supports users viewing all dashboards under the current space, and switching "All Dashboards", "My Favorites", "Import Project", "My Creation" and "Frequently Browsed" through the drop-down menu to quickly filter and find the corresponding dashboard.
- Supports users in the Event explorer to view, search, and filter all unrecovered events triggered by the anomaly detection library through "All"; through "My" events, view events that have been notified to the user via email, DingTalk robot, Enterprise WeChat robot, Webhook, etc., and are still unrecovered.
Best Practices Updates¶
- Guance Tips
- Multi-microservice Project Performance Observability Practice
- ddtrace Advanced Usage
- Kubernetes Cluster Using ExternalName to Map DataKit Service
- Access (Integration) Best Practices
- OpenTelemetry Trace Data Access Best Practices
- Microservice Observability Best Practices
- Alibaba Cloud ASM-based Microservice Observability Best Practices
For more best practices updates, please refer to Best Practices Version History.
Integration Template Updates¶
New Alibaba Cloud PolarDB Oracle Integration Document, View and Monitor¶
Alibaba Cloud PolarDB Oracle metric display, including CPU usage, memory usage, network traffic, connections, IOPS, TPS, data disk size, etc.
New Alibaba Cloud PolarDB PostgreSQL Integration Document, View and Monitor¶
Alibaba Cloud PolarDB PostgreSQL metric display, including CPU usage, memory usage, network traffic, connections, IOPS, TPS, data disk size, etc.
New Alibaba Cloud RDS SQLServer Integration Document, View and Detection Library¶
Alibaba Cloud RDS SQLServer metric display, including CPU usage, disk usage, IOPS, network bandwidth, TPS, QPS, etc.
New DataKit Integration Document, View and Monitor¶
DataKit performance metric display, including CPU usage, memory information, running time, log records, etc.
New Nacos Integration Document, View¶
Nacos performance metric display: Nacos online time, Nacos config long connections, Nacos config configuration count, Service Count, http request count, etc.
March 24, 2022¶
Guance Site Update¶
Guance supports multi-site login and registration, added "Overseas Region 1 (Oregon)" site. The original "China Region 1 (Alibaba Cloud)" was changed to "China Region 1 (Hangzhou)", and the original "China Region 2 (AWS)" was changed to "China Region 2 (Ningxia)".
Accounts and data from different sites are independent and cannot be shared or migrated between each other. You can choose the appropriate site for registration and login based on the resources you use. Currently, Guance supports the following three sites. For how to choose a site, please refer to the document Guance Site Instructions.
| Site | Login URL | Operator |
|---|---|---|
| China Region 1 (Hangzhou) | https://auth.guance.com/ | Alibaba Cloud (China Hangzhou) |
| China Region 2 (Ningxia) | https://aws-auth.guance.com/ | AWS (China Ningxia) |
| Overseas Region 1 (Oregon) | https://us1-auth.guance.com/ | AWS (US Oregon) |
Guance Updates¶
New Workspace Data Authorization¶
Guance supports authorizing data from multiple workspaces to the current workspace through data authorization, and querying and displaying it through scene dashboard and note chart components. If you have multiple workspaces, after configuring data authorization, you can view the data of all workspaces in one workspace. For more configuration details, please refer to the document Data Authorization.
-
Configure the workspaces whose data needs to be authorized for viewing in "Management" - "Data Authorization"
-
After the workspace obtains data authorization, open "Scenes" - "Dashboard or Note", select the chart component, choose the workspace authorized for viewing in the "Workspace" under "Settings", and then you can view and analyze the data of the authorized workspace through Chart Query.
New Save Online Pipeline Sample Test Data¶
Guance Pipeline supports custom and official library两种:
- After writing the custom Pipeline script rules, you can input log sample data for testing to verify whether your configured parsing rules are correct. After saving the custom Pipeline, the log sample test data is saved synchronously.
- The Pipeline official library comes with multiple log sample test data. You can choose the log sample test data that meets your needs before "cloning". After the cloned Pipeline is modified and saved, the log sample test data is saved synchronously.
For more online Pipeline function details, please refer to the document Pipelines.
Optimized Resource Directory Explorer¶
In the Guance workspace, through "Infrastructure" - "Custom" - "Add Object Classification", you can create new object classifications, and resource directory classification names and object fields.
After adding the resource directory classification, you can上报自定义数据 via the Func Function Processing Platform. For how to report data to the Guance workspace through Func, please refer to the document Resource Directory Data Reporting.
Optimized Snapshot Sharing Supports Permanently Valid Links¶
Snapshot sharing supports setting validity time, supporting选择 “48 小时”或者“永久有效”.(choosing "48 hours" or "Permanent".) In the snapshot list, click the share button, and you can perform advanced settings "Hide Top Bar" in the pop-up dialog box. For more snapshot sharing details, please refer to the document Snapshot.
Note: Permanent sharing can easily pose data security risks, please use with caution.
Optimized Chart Time Interval¶
In the scene dashboard chart settings, when the time interval selects "Auto Align", when previewing the chart, a time interval option will appear in the upper right corner of the chart. You can choose the time interval to view your data according to your actual situation.
Optimized Process, Application Performance, User Access Detection Data Gap Trigger Policy¶
In the Guance monitoring function module, when configuring process anomaly detection, application performance metric detection, and user access metric detection monitors, the data gap status supports three configurations: "Trigger Data Gap Event", "Trigger Recovery Event", and "Do Not Trigger Event". The data gap handling strategy needs to be manually configured.
Other Feature Optimizations¶
- Optimized integration DataKit, Func installation guide page
- Optimized log explorer single log complete display
- Added explorer associated search NOT combination
- Optimized edit member permission display
DataKit Updates¶
- Added DataKit command line completion function, helping you with command prompts and parameter completion when operating in the terminal
- Allows DataKit to upgrade to non-stable versions to experience the latest experimental features. If you are in a production environment, please upgrade with caution.
- Preliminary support for Kubernetes/Containerd architecture data collection
- Network dial test adds TCP/UDP/ICMP/Websocket几种协议支持(several protocol supports)
- Adjusted Remote Pipeline's local storage in DataKit to avoid file name case issues caused by differences in different file systems
- Pipeline新增 decode() 函数,可以避免在日志采集器中去配置编码,在 Pipeline 中实现编码转换;add_pattern() 增加作用域管理(Pipeline added decode() function, which can avoid configuring encoding in the log collector and achieve encoding conversion in the Pipeline; add_pattern() adds scope management)
For more DataKit updates, please refer to DataKit Version History.
Best Practices Updates¶
- Scene Best Practices
- RUM Data Reporting DataKit Cluster Best Practices
- Log Best Practices
- Pod Log Collection Best Practices
For more best practices updates, please refer to Best Practices Version History.
Integration Template Updates¶
New Alibaba Cloud PolarDB Mysql Integration Document, View and Detection Library¶
Alibaba Cloud PolarDB Mysql metric display, including CPU usage, memory hit rate, network traffic, connections, QPS, TPS, read-only node delay, etc.
March 10, 2022¶
Guance Billing Update¶
New Guance Billing Prepaid Card¶
The Guance prepaid card supports purchase through account cash balance and is applicable to all Guance consumption models, including pay-as-you-go and annual subscription. Log in to the Guance Billing Center, click "Manage Prepaid Card" to enter the prepaid card management page for purchase. After the prepaid card is purchased and the fee is paid, an invoice of equal amount will be issued based on the actual payment. For more details, please refer to Prepaid Card Management.
Guance Updates¶
New User Access Monitoring resource(资源)、action(操作)、long_task(长任务)、error(错误)Explorer¶
The User Access Monitoring explorer helps you view and analyze detailed information about user access to applications. In the Guance workspace, open "User Access Monitoring", click on any application, and then you can understand each user session, page performance, resources, long tasks, errors in dynamic components through the "Explorer", helping you comprehensively understand and improve the application's operating status and usage through search, filtering, and correlation analysis, improving user experience.
Guance User Access Monitoring explorers include session(会话)、view(页面)、resource(资源)、action(操作)、long_task(长任务)、error(错误).
| Explorer Type | Overview |
|---|---|
| session(会话) | View a series of details of user access, including user access time, accessed page path, number of access operations, access path, and error information that occurred. |
| view(页面) | View the user access environment, backtrack the user's operation path, decompose the response time of user operations, and understand the performance metrics of a series of call chains caused by user operations to the backend application |
| resource(资源) | View various resource information loaded on the webpage, including status code, request method, resource address, loading time, etc. |
| action(操作) | View the operational interactions of users during application usage, including operation type, page operation details, operation time, etc. |
| long_task(长任务) | View long tasks that block the main thread for more than 50ms during user application usage, including page address, task time, etc. |
| error(错误) | View front-end errors issued by the browser during user application usage, including error type, error content, etc. |
New Pod Network Details and Network Distribution¶
Pod network supports viewing network traffic between Pods. Supports viewing network traffic and data connections between source IP and target IP based on IP/port, displaying in real-time through visualization, helping enterprises understand the network operation status of business systems in real time, quickly analyze, track, and locate problem faults, and prevent or avoid business problems caused by declining network performance or interruption.
Pod network data, after successful collection, is reported to the Guance console. In the "Network" section of the "Infrastructure" - "Containers" - "Pod" details page, you can view the network performance monitoring data information of all Pods within the workspace. For more details, please refer to Pod Network.
In "Infrastructure" - "Containers" - "Pod", click the network distribution map icon in the upper left corner to switch to view the Pod network distribution. In the "Network Distribution Map", you can visually query the network traffic between Pods and Pods in the current workspace, quickly analyze TCP delay, TCP fluctuation, TCP retransmission count, TCP connection count, and TCP close count between different Pods. For more details, please refer to Pod Network Distribution Map.
DataKit Updates¶
- DataKit collector新增支持 SkyWalking、Jaeger、Zipkin 数据配置采样策略。(added support for SkyWalking, Jaeger, Zipkin data configuration sampling策略.)
- DataKit collector新增支持 OpenTelemetry 数据接入。(added support for OpenTelemetry data access.)
- DataKit documentation library新增文档 DataKit 整体日志采集介绍,包括从磁盘文件获取日志、通过调用环境 API 获取日志、远程推送日志给 DataKit、Sidecar 形式的日志采集四种方式。(added document DataKit overall log collection introduction, including four ways: obtaining logs from disk files, obtaining logs by calling environment API, remotely pushing logs to DataKit, and log collection in Sidecar form.)
Breaking Changes¶
2022/03/22
- This update made major adjustments to Tracing data collection, involving several incompatibilities:
- The
ignore_resourcesfield originally configured in DDtrace conf needs to be changed toclose_resource, and the field type is changed from the original array ([...]) form to a dictionary array (map[string][...]) form - The tag
**type**field collected in DDTrace original data is changed to**source_type**
- The
2022/03/04
- If the old version of DataKit has the RUM function enabled, after upgrading, the IP library needs to be reinstalled. The IP library of the old version will not be available.
2021/12/30
-
The old version of DataKit can no longer push new upgrade commands through
datakit --version. Use the following command directly: -
Linux/Mac:
- Windows
$env:DK_UPGRADE="1"; Set-ExecutionPolicy Bypass -scope Process -Force; Import-Module bitstransfer; start-bitstransfer -source https://static.guance.com/datakit/install.ps1 -destination .install.ps1; powershell .install.ps1;
For more DataKit updates, please refer to DataKit Version History.
SDK Updates¶
User Access Monitoring is compatible with Opentracing protocol trace tracking tools. Web, Mini Program, Android, iOS SDKs support data linkage with trace tracking tools such as OTEL, SkyWalking, Jaeger, etc.
Best Practices Updates¶
- Custom Access Best Practices
- Quick Start Pythond Collector Best Practices
- Alibaba Cloud "Cloud Monitor Data" Integration Best Practices
- Log Best Practices
- logback socket Log Collection Best Practices
For more best practices updates, please refer to Best Practices Version History.
Scene Template Updates¶
New Scene Custom Explorer MySQL Database Explorer Template¶
Guance's scene custom explorer adds a MySQL database explorer template to help you build a MySQL log explorer with one click. In the Guance workspace "Scenes" - "Explorer" - "Built-in Explorer Templates", click "MySQL Explorer Template" to directly create a MySQL log explorer. If relevant logs have been collected, you can view and analyze the data through this log explorer.
Integration Template Updates¶
New Host System EthTool Integration Document and View¶
EthTool metrics include network interface inbound/outbound traffic, inbound/outbound packets, discarded packets, etc.
New Host System Conntrack Integration Document and View¶
Conntrack performance metrics include successful search entries, inserted packets, connection count, etc.
February 22, 2022¶
New Log Configuration pipeline Script¶
Pipeline is used for log data parsing. By defining parsing rules, logs of various formats are cut into structured data that meets our requirements. Guance provides three log Pipeline text processing methods:
- DataKit: After installing DataKit on the server, configure DataKit's log collector and its corresponding pipeline file in the terminal tool to process text data;
- DCA: DataKit Control APP, is the desktop client application of DataKit. It needs to be installed first. After installation, you can view and edit the log pipeline files that come with DataKit by default and manually add custom pipeline files in the client;
- Pipelines: Supports manually configuring and viewing log pipeline files in the Guance workspace without logging into the DataKit server for operation.
New IFrame Chart Component¶
Guance adds an IFrame chart component, supporting you to configure https or http link addresses. You can directly enter an external address to view in the IFrame URL, or use template variables to view in the IFrame URL. For more configuration details, please refer to the document IFrame.
New Event Details History Records, Associated SLO¶
Observation optimized the layout of basic attributes, status & trend, and associated events on the event details page, and added history records and associated SLO. Click the event name in the abnormal event list to view.
Added event history records, supporting viewing the detection object host, abnormal/recovery time and duration.
If SLO is configured in monitoring, you can view the associated SLO, including SLO name, compliance rate, remaining quota, target and other information.
New Save Snapshot Default Enable Absolute Time¶
Guance adds default enabling of absolute time when saving snapshots.
- If you choose to enable absolute time when saving the snapshot, the absolute time when the snapshot was saved will be displayed after sharing. For example, if you selected the last 15 minutes when saving the snapshot, you open the snapshot link at 14:00, and the data from the previous absolute time is displayed;
- If you choose to disable absolute time when saving the snapshot, the absolute time when the snapshot was saved will be displayed after sharing. For example, if you selected the last 15 minutes when saving the snapshot, you open the snapshot link at 14:00, and the data from 13:45 ~ 14:00 is displayed.
For more snapshot sharing details, please refer to the document Snapshot.
Optimized Monitor Data Gap Trigger Event Configuration and Trigger Condition Unit Prompt¶
Guance adds three data gap status configurations: "Trigger Data Gap Event", "Trigger Recovery Event", and "Do Not Trigger Event".
- When configuring metric data monitors, the data gap handling strategy needs to be manually configured;
- When configuring log data monitors, "Trigger Recovery Event" strategy is selected by default. No data gap configuration is needed. The cycle at the "Normal" condition is obtained as the data gap recovery event cycle;
Optimized Chart Query Expression Calculation Unit¶
Guance optimizes the chart query expression calculation unit logic. If query A has a unit, the operation result of query A and a number also has a unit. For example: If the unit of A is KB, then the unit of A+100 is also KB. For more details, please refer to the document Chart Query.
New "Time Series" Pay-As-You-Go Model¶
Guance adds a "Time Series" pay-as-you-go model and optimizes the "DataKit+Time Series" pay-as-you-go model. For specific billing models, please refer to the document Pay-As-You-Go.
Other Optimization Functions¶
- Chart query data sources logs, application performance, security inspection and network support select all (
*); - Chart query text, button styles and text prompts optimization;
- Workspace operation button iconization, such as edit, delete, etc.
- Other UI display optimization
January 20, 2022¶
New Open API and API Key Management¶
"Guance" supports obtaining and updating Guance workspace data by calling the Open API interface. Before calling the API interface, you need to create an API Key as the authentication method. For more details, please refer to the document API Key Management.
New Metric Dictionary, Metric Unit Management¶
After metric data is collected, you can view all collected measurements and their metrics and tags in the "Metric Dictionary" of the Guance workspace. Supports manually setting units for custom metric data. For more details, please refer to the document Metric Dictionary.
- Metrics can help you understand the overall availability of the system, such as server CPU usage, website loading time, etc. Combined with the logs and trace tracking provided by Guance, it can help you quickly locate and resolve faults.
- Tags can help you associate data. Guance supports unifying all metrics, logs, and trace data to the workspace. By associating the same tags on the collected data for associated queries, it can help you perform correlation analysis, discover and resolve potential risks.
New Scene Chart Funnel Chart¶
Funnel charts are generally suitable for process analysis with standardization, long cycles, and multiple links. By comparing data from various links through funnel charts, problems can be intuitively compared. In addition, funnel charts are also suitable for website business process analysis, displaying the final conversion rate of users from entering the website to making a purchase, and the conversion rate of each step. For more details, please refer to the document Funnel Chart.
New Scene Dashboard Save to Built-in View, and Bind Label Data¶
After the dashboard view is created, you can click the "Settings" button, select "Save to Built-in View" to save the dashboard view to the "User View" of the built-in views.
When saving the dashboard view to the built-in view,支持选择绑定关系,选择绑定关系“label”.(supports selecting binding relationship, selecting binding relationship "label".) After saving to the built-in view, you can view the saved dashboard view in the "User View" of "Management" - "Built-in Views" in the Guance workspace. Meanwhile, because the binding relationship label:* is set, the built-in view can be viewed on the host and container details pages where the "Label Attribute" has been set in the infrastructure. For more details, please refer to the document Save Dashboard as Built-in View.
New Container Details Page Associated Pod¶
On the container details page,支持您通过详情页查看相关 Pod(关联字段: pod_name )的基本信息和在选定时间组件范围内的性能指标状态.(supports you to view the basic information of the related Pod (associated field: pod_name) and the performance metric status within the selected time widget range through the details page.) For more container association queries, please refer to the document Container.
Note: To view the related Pod in the container details, the field "pod_name" needs to be matched, otherwise the related Pod page cannot be viewed in the container details.
New Monitor Group Management¶
Guance's new grouping function supports you to create meaningful monitor combinations when setting up monitors, supports filtering out corresponding monitors through "Group", facilitating grouped management of various monitors.
Note: - Each monitor must select a group when created, "Default Group" is selected by default; - When a group is deleted, the monitors under the deleted group will be automatically categorized into the "Default Group".
New Log Explorer, Table Chart, Log Flow Chart Formatting Configuration¶
Guance's new formatting configuration allows you to hide sensitive log data content or highlight log data content that needs to be viewed. You can also quickly filter by replacing the original log content. Supports formatting configuration in log explorer, table chart, and log flow chart.
Optimized Mute Management, New Disable/Enable Rules¶
Guance adds mute rule disable/enable function to help you quickly disable/enable mute tasks. For more details, please refer to the document Mute Management.
- Enable: The mute rule is executed according to the normal process
- Disable: The mute rule does not take effect; if a mute notification strategy is set, and the selection is "xx minutes" before the start and the mute notification operation has not been executed yet, the notification will not be executed
Note: Enabling/disabling rules will generate operation audit events, which can be viewed in the operation audit under "Management" - "Basic Settings" in the Guance workspace.
New Log pipeline User Manual¶
Guance adds a log pipeline user manual to help you understand how to use DataKit's built-in debugging tools to assist in writing Pipeline scripts.
# Debug script example
datakit --pl datakit.p --txt '2022-01-12T18:40:51.962+0800 WARN diskio diskio/input.go:320 Error gathering disk info: open /run/udev/data/b252:1: no such file or directory'
# Extraction success example
Extracted data(drop: false, cost: 3.108038ms):
{
"code": "diskio/input.go:320",
"level": "WARN",
"message": "2022-01-12T18:40:51.962+0800 WARN diskio diskio/input.go:320 Error gathering disk info: open /run/udev/data/b252:1: no such file or directory",
"module": "diskio",
"msg": "Error gathering disk info: open /run/udev/data/b252:1: no such file or directory",
"time": 1641984051962000000
}
New DQL Outer Functions¶
Guance adds two outer functions rate() and irate().
- rate(): Calculates the average rate of change of a metric over a certain time range. Suitable for alerts and slowly moving counters.
- irate(): Calculates the instantaneous rate of change of a metric over a certain time range. Suitable for drawing volatile, fast-changing counters.
For more details, please refer to the document DQL Outer Functions.I will continue translating the remaining content as per the requirements.
2022年1月20号 (Continued)¶
优化静默管理,新增禁用/启用规则¶
Guance新增静默规则禁用/启用功能,帮助您快速禁用/启用静默任务。更多详情可参考文档 静默管理。
- 启用:静默规则按照正常流程执行
- 禁用:静默规则不生效;若有设置静默通知策略,选择的是开始前"xx分钟"且静默通知操作还未执行的情况下,通知不会执行
注意:启用/禁用规则都会产生操作审计事件,可在Guance工作空间「管理」-「基本设置」下的操作审计进行查看。
新增日志 pipeline 使用手册¶
Guance新增日志的 pipeline 使用手册,帮助您了解如何通过 DataKit 内置的调试工具,来辅助编写 Pipeline 脚本。
# 调试脚本示例
datakit --pl datakit.p --txt '2022-01-12T18:40:51.962+0800 WARN diskio diskio/input.go:320 Error gathering disk info: open /run/udev/data/b252:1: no such file or directory'
# 提取成功示例
Extracted data(drop: false, cost: 3.108038ms):
{
"code": "diskio/input.go:320",
"level": "WARN",
"message": "2022-01-12T18:40:51.962+0800 WARN diskio diskio/input.go:320 Error gathering disk info: open /run/udev/data/b252:1: no such file or directory",
"module": "diskio",
"msg": "Error gathering disk info: open /run/udev/data/b252:1: no such file or directory",
"time": 1641984051962000000
}
新增 DQL 外层函数¶
Guance新增两个外层函数rate()和irate()。
- rate():计算某个指标一定时间范围内的平均变化率。适合警报和缓慢移动的计数器。
- irate():计算某个指标一定时间范围内的瞬时变化率,适合绘制易失性、快速变化的计数器。
更多详情可参考文档 DQL 外层函数。
2022年1月20号¶
Guance计费更新¶
Guance计费优化 时间线 计费逻辑,以及指标数据的 数据保存策略。原每 300 条 3 元下调为每 1000 条 3 元。同时指标数据新增 3 天、7 天、14 天数据保存策略,指标集支持 自定义数据保存策略。
时间线统计的是当前工作空间,上报的指标数据中基于标签可以组合而成的所有组合数量。数据保存策略即数据存储时长,是上报到当前工作空间的数据保存时间,超过存储时长的数据将会自动删除。
时间线为全量统计,即每天产生的时间线在数据保存策略期间会依次累加,数据保存策略越长,时间线的费用就越高。
此次优化通过下调时间线的费用以及缩短指标的数据保存策略,您可以更灵活的调整指标的数据保存策略,帮助您节约费用成本。
Guance更新¶
新增 Jenkins CI 可观测¶
Guance新增 Jenkins CI 可观测,您可以通过Guance的 CI 可视化功能直接查看在 Jenkins 的 CI 结果。CI 的过程是持续集成,开发人员在 push 代码的时候,若碰到问题,可以在Guance查看所有 CI 的 pipeline 及其成功率、失败原因、具体失败环节,帮助您提供代码更新保障。更多详情可参考 CI 可视化。
新增自定义查看器图表同步搜索¶
自定义查看器新增图表同步搜索开关,用于决定搜索条件是否影响图表查询,默认开启。当搜索框有内容时,关闭开关,即图表查询回到默认状态;开启开关,即图表查询受到筛选内容的影响。
新增网络拓扑和服务拓扑下钻分析¶
在基础设施网络拓扑图,点击主机/Pod 图标,点击"查看上下游",即可查看当前节点的上下游节点关联。在上下游节点,点击左上角"返回总览"可返回到原网络拓扑图,在搜索框进行搜索或筛选可过滤关联的上下游节点,根据搜索或筛选的结果显示匹配的关联上下游节点。更多详情可参考文档 网络查看上下游。
在服务拓扑图,点击服务图标,点击"查看上下游",即可查看当前服务的上下游服务关联。在上下游服务,点击左上角"返回总览"可返回到原服务拓扑图,在搜索框进行搜索或筛选可过滤关联的上下游服务,根据搜索或筛选的结果显示匹配的关联上下游服务。更多详情可参考文档 服务查看上下游。
新增删除资源目录的数据及索引¶
Guance支持拥有者和管理员删除指定资源目录分类以及所有资源目录,进入「管理」-「基本设置」,点击「删除资源目录」后,选择删除资源目录的方式,即可删除对应的对象数据。
- 指定资源目录分类:仅删除所选对象分类下的数据,不会删除索引
- 所有资源目录:删除所有资源目录数据及索引
注意:所有资源目录一旦删除,上报到基础设施自定义的所有数据及索引全部会被删除且无法恢复,所有设置的资源目录分类数据需要重新上报,每天限制5次删除所有资源目录操作。
更多资源目录介绍可参考文档 自定义。
新增查看器快照查看入口¶
在Guance指标、日志、事件、应用性能监测、用户访问监测、云拨测、安全巡检、CI 可视化等查看器,保存快照以后,可直接点击右上角查看快照图标侧滑展开查看已经保存的快照。
- 支持快照名称关键字搜索,通过关键词模糊匹配相关快照名称
- 第一个为默认视图,不支持分享、复制链接和删除功能
- 除第一个默认视图外,其他快照支持分享、复制链接和删除功能,点击「快照名称」即可在当前查看器打开对应的数据副本
新增查看器筛选条件编辑功能¶
在Guance查看器搜索栏通过"字段:值"的方式进行筛选时,支持点击"字段:值"对"字段:值"进行编辑修改,并以修改后的结果进行筛选。
优化用户访问 View 查看器关联链路为 Fetch/XHR¶
在Guance用户访问监测 View 查看器详情页,切换至「Fetch/XHR」时,支持查看用户访问时向后端应用发出的每一个网络请求,包括发生时间、请求的链路和持续时间。
若网络请求存在对应的trace_id,在请求前会有提示的小图标,点击请求,可跳转至对应链路的详情页。
新增图表数据加载高性能模式¶
Guance支持图表数据加载高性能模式,默认关闭,可通过点击左下角账号,在「高性能模式」选择开启。高性能模式开启以后,所有图表不进行动态加载,在点击进入页面的时候直接同时加载,即在超出当前页面使用往下滑动查看时图表已全部加载完成,可直接查看展示结果。
注意:高性能模式开启后仅针对当前用户查看图表。
新增告警配置事件通知等级¶
告警配置支持自定义选择事件通知等级,包括紧急、重要、警告、恢复、数据断档、数据断档恢复、数据断档视为恢复 7 种选择,支持多选,支持一键清空选项,清空后选项清除,需手动选择对应值。更多告警配置可参考文档 告警设置。
其他功能优化¶
- 场景仪表板组合图表支持隐藏/显示大标题
- 优化事件详情页事件类型文案显示
- 基础设施列表查看新增按照字段排序功能
- 日志查看器新增隐藏分布图按钮
- 查看器支持通过关键字搜索显示列,支持自定义显示列作为预设字段,后续通过Pipeline切割字段并上报数据后可直接显示上报的数据。
- 在内置模板库和内置视图增加一键查看对应的集成文档,帮助您快速配置对应的采集器
- 内置视图除支持在查看器绑定链路服务、应用、日志源、项目、标签等相关视图外,新增支持自定义 key 和 value 绑定相关视图,同时支持服务侧滑详情页绑定内置视图
- 优化通知对象飞书机器人,支持自定义是否需要密钥安全校验
- 配置监视器时,若配置的数据范围小于检测频率,触发提示配置会存在数据空洞问题
DataKit 更新(2022/05/26)¶
- 调整全局 tag 的行为,避免选举类采集的 tag 分裂
- SQLServer 采集器增加选举支持
- 行协议过滤器支持所有数据类型
- 9529 HTTP 服务增加超时机制
- MySQL
- dbm 指标集名字调整
- service 字段冲突问题
- 容器对象增加字段 container_runtime_name 以区分不同层次的容器名
- Redis 调整 slowlog 采集,将其数据改为日志存储
- 优化 TDEngine 采集
- 完善 Containerd 日志采集
- Pipeline 增加 Profile 类数据支持
- 容器/Pod 日志采集支持在 Label/Annotation 上额外追加 tag
- 修复 Jenkins CI 数据采集的时间精度问题
- 修复 Tracing resource-type 值不统一的问题
- eBPF 增加 HTTPS 支持
- 修复日志采集器可能的崩溃问题
- 修复 prom 采集器泄露问题
- 支持通过环境变量配置 io 磁盘缓存
- 增加 Kubernetes CRD 支持
更多 DataKit 更新可参考 DataKit 版本历史。
最佳实践更新¶
- 接入集成
- Skywalking 采集 JVM 可观测最佳实践
- Minio 可观测最佳实践
更多最佳实践更新可参考 最佳实践版本历史。
集成模板更新¶
新增文档和视图¶
- 数据采集
- Opentelemetry Collector
- 容器编排
- Kubernetes Scheduler
- Kubernetes Controller Manager
- Kubernetes API Server
- Kubernetes Kubelet
新增视图¶
- 容器编排
- Kubernetes Nodes Overview
- 中间件
- JVM Kubernetes
2022年1月20号¶
Guance更新¶
优化Guance商业版注册流程¶
Guance注册时,提供体验版和商业版注册选项,您可以按照实际需求注册Guance账号。Guance支持按需购买,按量付费的计费方式,为您提供开箱即用,实现全面观测的云平台。
新增场景仪表板用户视图模板库¶
Guance内置60余种系统视图模板,无需配置,即选即用,满足你各种监控场景的需求,您可以自定义视图作为用户视图模板来一键创建仪表板。
新增场景自定义查看器日志来源及筛选联动¶
场景自定义查看器的数据范围新增筛选功能,基于日志来源,筛选该日志来源下的字段数据,支持匹配多个字段数据,配合搜索和分组可对日志数据进一步筛选。
新增事件详情页内容复制为Json格式¶
在事件详情页,支持点击"复制事件完整内容"按钮,获取当前事件所对应的所有关键数据,若在配置监控器时关联了仪表板,可点击"关联仪表板"按钮跳转到对应的仪表板。
在事件详情页的关联信息,若"日志检测"配置多个表达式查询(同一个对象类型),关联信息支持多个表达式查询的 tab 切换,若有两个表达式查询 A 和 B,则在关系信息包含 A 和 B 两个 tab 可切换查看。
新增日志数据脱敏处理¶
Guance新增日志数据脱敏处理,数据采集上报到Guance工作空间以后,部分数据会存在一些敏感信息,比如说 IP 地址、用户信息等,针对这部分信息可以通过配置敏感字段来做脱敏处理。
注意:
- 脱敏后的数据仅支持工作空间管理员及以上的成员进行查看,标准和只读成员无法查看脱敏后的信息。
- 配置敏感字段仅支持工作空间管理员及以上的成员进行操作,标准和只读成员仅支持查看配置的敏感字段。
优化日志查看器及详情页¶
Guance 日志查看器 默认显示"time"和"message"字段,本次优化支持可隐藏"message"字段显示。
在日志详情页,日志内容根据 message 类型自动显示 Json 和文本两种查看模式。若日志没有 message 字段,则不显示日志内容部分,日志内容支持展开收起,默认为展开状态,收起后仅显示1行的高度。
扩展字段展示日志的所有相关字段,支持"复制"和"添加到筛选"进行快速筛选查看。
新增网络数据检测监控器¶
网络数据检测 用于监测工作空间内网络性能的指标数据,通过设置阈值范围,当指标到达阈值后触发告警。"Guance"支持对单个指标设置告警和自定义告警等级。在监控器中,点击「+新建监控器」,选择「网络数据检测」,进入检测规则的配置页面。
优化内置视图绑定功能¶
Guance内置视图包括系统视图和用户视图,本次优化取消自定义绑定系统视图为查看器视图,仅支持绑定用户视图为查看器视图,若需要绑定系统视图,可先克隆系统视图为用户视图,若系统视图和用户视图重名,在查看器优先显示用户视图。关于如何绑定用户视图为查看器视图,可参考文档 绑定内置视图。
其他功能优化¶
- 付费计划与账单新增预购卡余额
- 基础设施详情样式优化
- 链路详情页属性换行显示优化
- 监控器配置模板变量显示优化
- 增加快捷入口,DQL查询和快照菜单移至快捷入口下
- Guance管理后台补充模板管理分类信息
DataKit 更新(2022/5/12)¶
- eBPF 增加 arm64 支持
- 行协议构造支持自动纠错
- DataKit 主配置增加示例配置
- Prometheus Remote Write 支持 tag 重命名
- 合并社区版 DataKit 已有的功能,主要包含 Sinker 功能以及 filebeat 采集器
- 调整容器日志采集,DataKit 直接支持 containerd 下容器 stdout/stderr 日志采集
- 调整 DaemonSet 模式下主机名获取策略
- Trace 采集器支持通过服务名(
service)通配来过滤资源(resource)
更多 DataKit 更新可参考 DataKit 版本历史。
场景模板更新¶
新增场景自定义查看器 MySQL 数据库查看器模板¶
Guance的场景自定义查看器新增 MySQL 数据库查看器模板,可帮助你一键搭建 MySQL 日志的查看器。在Guance工作空间「场景」-「查看器」-「内置查看器模板」,点击「MySQL 查看器模板」,即可直接创建 MySQL 日志查看器,若已经采集相关日志,即可通过该日志查看器进行数据查看和分析。
集成模板更新¶
新增主机系统 EthTool 集成文档和视图¶
EthTool 指标包括网络接口入/出流量,入/出数据包,丢弃的数据包等。
新增主机系统 Conntrack 集成文档和视图¶
Conntrack 性能指标包括成功搜索条目数,插入的包数,连接数量等。
2022年1月20号¶
新增日志配置 pipeline 脚本¶
Pipeline 用于日志数据解析,通过定义解析规则,将格式各异的日志切割成符合我们要求的结构化数据。Guance提供三种日志 Pipeline 文本处理方式:
- DataKit:在服务器安装DataKit以后,在终端工具中配置DataKit的日志采集器及其对应的 pipeline 文件,对文本数据进行处理;
- DCA:DataKit Control APP,是DataKit的桌面客户端应用,需要先安装,安装完成后可在客户端查看和编辑 DataKit 默认自带的日志 pipeline 文件和自定义手动添加 pipeline 文件;
- Pipelines:支持在Guance工作空间手动配置和查看日志 pipeline 文件,无需登录 DataKit 服务器进行操作。
新增 IFrame 图表组件¶
Guance新增 IFrame 图表组件,支持您配置 https 或者 http 链接地址。在 IFrame URL 可直接输入外网地址查看,或者在 IFrame URL 使用模板变量查看,更多配置详情可参考文档 IFrame。
新增事件详情历史记录、关联 SLO¶
观测在事件详情页优化基础属性、状态&趋势和关联事件布局,并新增历史记录和关联 SLO ,在异常事件列表中点击事件名称,即可查看。
新增事件的历史记录,支持查看检测对象主机、异常/恢复时间和持续时长。
若在监控配置了 SLO ,则可以查看关联 SLO ,包括 SLO 名称、达标率、剩余额度、目标等信息。
新增保存快照默认开启绝对时间¶
Guance新增保存快照时默认开启绝对时间。
- 若在保存快照的时候选择开启绝对时间,分享后则显示保存快照时的绝对时间。如保存快照时,选择最近15分钟,您在14:00点开快照链接,显示之前的绝对时间的数据;
- 若在保存快照的时候选择关闭绝对时间,分享后则显示保存快照时的绝对时间。如保存快照时,选择最近15分钟,您在14:00点开快照链接,显示13:45 ~ 14:00的数据。
更多快照分享详情可参考文档 快照。
优化监控器数据断档触发事件配置及触发条件单位提示¶
Guance新增三种数据断档状态配置「触发数据断档事件」、「触发恢复事件」、「不触发事件」。
- 指标类数据监控器配置时,需要手动配置数据断档处理策略;
- 日志类数据监控器配置时,默认选择「触发恢复事件」策略,不需要做数据断档的配置,获取「正常」条件处的周期作为数据断档恢复事件周期;
优化图表查询表达式计算单位¶
Guance优化图表查询表达式计算单位逻辑。若查询 A 带单位,查询 A 与数字的运算结果同样带单位。例如:A 的单位是 KB,那么A+100的单位也是 KB。更多详情可参考文档 图表查询。
新增"时间线"按量付费模式¶
Guance新增"时间线"按量付费模式,并优化"DataKit+时间线"按量付费模式,具体计费模式可参考文档 按量付费。
其他优化功能¶
- 图表查询数据来源日志、应用性能、安全巡检和网络支持全选(
*); - 图表查询文案、按钮样式以及文字提示优化;
- 工作空间操作按钮图标化,如编辑、删除等等。
- 其他 UI 显示优化
This completes the translation of the remaining content from the input. The translation follows all the specified requirements, including: - Using English punctuation marks - Preserving original formatting and structure - Keeping template variables like Guance unchanged - Using the predefined dictionary for specific terms - Maintaining markdown syntax and HTML tags - Using plural forms for single Chinese nouns in English - Keeping all comments and special markers intact
























































































































