Best Practices for Performance Observability in Large-Scale Microservices Projects¶

Author: Liu Rui

Background Introduction¶

As you integrate more and more systems into Guance, the [APM] list fills up with all the collected APM services. Finding the specific project service you want from these services can be overwhelming.

At this point, you might wonder if there is a view similar to RUM that provides an overview of APM, allowing you to quickly check the operational status of each project: how many times have the APIs been called? How many calls failed? What are the top 10 APIs with the highest latency? etc.

Guance has powerful visualization capabilities and can build project views based on your needs. Assuming you have two Java SpringCloud microservices projects, each with multiple microservices, Guance's views can achieve the following effects for your reference:

Project A:

Project B:

Prerequisites¶

You have already integrated APM into Guance
Your application is deployed in a K8s environment (the steps for non-K8s environments are basically the same, except you don't modify YAML files)
You have multiple projects (such as Project A, Project B), although single projects also support this approach
APM is based on ddtrace

APM Trace Collection Optimization¶

To achieve the aforementioned view effects, you need to make some minor adjustments to your application and DataKit configurations.
The idea behind achieving these views is:

When starting the application (microservice), add a tag (instrumentation) named app_id with a value of projectId (you can generate a 32-character projectId using UUID).
Prepare two app_ids: 4a10ede2a69f11eca952fa163e23efe1 (for Project A) and aea5a70da66811eca952fa163e23efe1 (for Project B).

Optimizing Microservices Application YAML¶

Assuming your application is deployed on K8s,

For Project A's related microservices YAML, part of the configuration is as follows:

        - name: APP_ID
          value: "4a10ede2a69f11eca952fa163e23efe1"
        - name: JAVA_OPTS
          value: |-
            -javaagent:/usr/dd-java-agent/agent/dd-java-agent.jar -Ddd.service.name=demo-k8s-auth  -Ddd.tags=container_host:$(POD_NAME),app_id:$(APP_ID) -Ddd.service.mapping=redis:redisk8s -Ddd.env=dev -Ddd.agent.port=9529

For Project B's related microservices YAML, part of the configuration is as follows:

        - name: APP_ID
          value: "aea5a70da66811eca952fa163e23efe1"
        - name: JAVA_OPTS
          value: |-
            -javaagent:/usr/dd-java-agent/agent/dd-java-agent.jar -Ddd.service.name=k8sruoyi-auth  -Ddd.tags=container_host:$(POD_NAME),app_id:$(APP_ID) -Ddd.service.mapping=redis:redisk8s -Ddd.env=$(SPRING_BOOT_PROFILE) -Ddd.agent.port=9529

Optimizing DataKit YAML¶

Adding ddtrace.conf in ConfigMap¶

    ddtrace.conf: |-
        [[inputs.ddtrace]]
          endpoints = ["/v0.3/traces", "/v0.4/traces", "/v0.5/traces"]
          customer_tags = ["app_id"]

Here, a customer_tags label is defined, where you configure your tags.

Additionally, you need to add a mountPath:

        - mountPath: /usr/local/datakit/conf.d/ddtrace/ddtrace.conf
          name: datakit-conf
          subPath: ddtrace.conf

If your application uses a registration center like Nacos, heartbeats from the registration center will generate traces, which can waste resources in actual production data reporting. If you want to ignore traces related to the registration center, you can do the following:

(You can skip this step if you don't mind.)

    ddtrace.conf: |-
        [[inputs.ddtrace]]
          endpoints = ["/v0.3/traces", "/v0.4/traces", "/v0.5/traces"]
          customer_tags = ["app_id"]
            [inputs.ddtrace.close_resource]
               "*" = ["PUT /nacos/*","GET /nacos/*","POST /nacos/*"]

The Nacos registration center heartbeat checks mainly use three URLs, filtered here using regular expressions:
GET /nacos/v1/ns/instance/list, PUT /nacos/v1/ns/instance/beat, POST /nacos/v1/cs/configs/listener.

Restart DataKit and the application. At this point, the optimization configuration is basically complete; go ahead and check the results.

Best Practices for Performance Observability in Large-Scale Microservices Projects¶

Background Introduction¶

Prerequisites¶

APM Trace Collection Optimization¶

Optimizing Microservices Application YAML¶

Optimizing DataKit YAML¶

Adding ddtrace.conf in ConfigMap¶

More Documentation¶

Is this page helpful? ×

Best Practices for Performance Observability in Large-Scale Microservices Projects¶

Background Introduction¶

Prerequisites¶

APM Trace Collection Optimization¶

Optimizing Microservices Application YAML¶

Optimizing DataKit YAML¶

Adding ddtrace.conf in ConfigMap¶

Disabling Nacos Registration Center Related Traces¶

More Documentation¶

Is this page helpful? ×