tencent cloud

Tencent Cloud Observability Platform

Release Notes and Announcements
Release Notes
Product Introduction
Overview
Strengths
Basic Features
Basic Concepts
Use Cases
Use Limits
Purchase Guide
Tencent Cloud Product Monitoring
Application Performance Management
Mobile App Performance Monitoring
Real User Monitoring
Cloud Automated Testing
Prometheus Monitoring
Grafana
EventBridge
PTS
Quick Start
Monitoring Overview
Instance Group
Tencent Cloud Product Monitoring
Application Performance Management
Real User Monitoring
Cloud Automated Testing
Performance Testing Service
Prometheus Getting Started
Grafana
Dashboard Creation
EventBridge
Alarm Service
Cloud Product Monitoring
Tencent Cloud Service Metrics
Operation Guide
CVM Agents
Cloud Product Monitoring Integration with Grafana
Troubleshooting
Practical Tutorial
Application Performance Management
Product Introduction
Access Guide
Operation Guide
Practical Tutorial
Parameter Information
FAQs
Mobile App Performance Monitoring
Overview
Operation Guide
Access Guide
Practical Tutorial
Tencent Cloud Real User Monitoring
Product Introduction
Operation Guide
Connection Guide
FAQs
Cloud Automated Testing
Product Introduction
Operation Guide
FAQs
Performance Testing Service
Overview
Operation Guide
Practice Tutorial
JavaScript API List
FAQs
Prometheus Monitoring
Product Introduction
Access Guide
Operation Guide
Practical Tutorial
Terraform
FAQs
Grafana
Product Introduction
Operation Guide
Guide on Grafana Common Features
FAQs
Dashboard
Overview
Operation Guide
Alarm Management
Console Operation Guide
Troubleshooting
FAQs
EventBridge
Product Introduction
Operation Guide
Practical Tutorial
FAQs
Report Management
FAQs
General
Alarm Service
Concepts
Monitoring Charts
CVM Agents
Dynamic Alarm Threshold
CM Connection to Grafana
Documentation Guide
Related Agreements
Application Performance Management Service Level Agreement
APM Privacy Policy
APM Data Processing And Security Agreement
RUM Service Level Agreement
Mobile Performance Monitoring Service Level Agreement
Cloud Automated Testing Service Level Agreement
Prometheus Service Level Agreement
TCMG Service Level Agreements
PTS Service Level Agreement
PTS Use Limits
Cloud Monitor Service Level Agreement
API Documentation
History
Introduction
API Category
Making API Requests
Monitoring Data Query APIs
Alarm APIs
Legacy Alert APIs
Notification Template APIs
TMP APIs
Grafana Service APIs
Event Center APIs
TencentCloud Managed Service for Prometheus APIs
Monitoring APIs
Data Types
Error Codes
Glossary

CVM Has No Monitoring Data

PDF
Focus Mode
Font Size
Last updated: 2024-01-27 17:35:59

Overview

The CVM instance must have the monitoring component agents installed to collect CVM metric data. If you cannot get the monitoring metric data, refer to this document to troubleshoot this problem. We recommend you reinstall the agents first as instructed in Installing CVM Agents and wait 3 minutes before checking again whether the monitoring data is restored. If an error occurs during the reinstallation or the monitoring data is not restored, troubleshoot the problem as detailed below.

Causes and Solutions

Cause
Solution
The agents are not installed/started
Troubleshoot by referring to Step 1
The reporting domains cannot be resolved
Troubleshoot by referring to Step 2
The agents failed to get the UUID
Troubleshoot by referring to Step 3
The CVM instance is shut down or being restarted
Troubleshoot by referring to Step 4
The CVM instance is under high load
Troubleshoot by referring to Step 5

Troubleshooting Procedure

Step 1. Check whether the agents have been installed or started

The troubleshooting procedures for Linux and Windows are different. You can refer to a procedure as needed.
Linux
Windows
1. Run the following command to check whether the agents have been installed successfully.
crontab -l |grep stargate
If the following message is displayed, the agents have been installed:

If not, install the agents as instructed in Installing CVM Agents.
2. Check whether the agents run properly. Run the following commands to check whether the agents run properly:
ps ax | grep sgagent
ps ax | grep barad_agent
If the output is as shown below, the agents run normally (note that the number of barad_agent processes is 3):

If there is no output or the number of processes is incorrect, the agents are abnormal. In this case, run the following commands as the root account to start the agents. If the messages stargate agent run succ and barad_agent run succ are displayed, the agents have been restarted successfully.
cd /usr/local/qcloud/stargate/admin
./restart.sh
cd /usr/local/qcloud/monitor/barad/admin
./stop.sh
./trystart.sh
Note:
After the agents are started, wait 3 minutes and then check whether there is monitoring data in the CVM console.
Run services.msc to check whether the agents are installed and started. If the status of QCloud BaradAgent Monitor or QCloud Stargate Manager is not Running, the service is not started. In this case, click the name of the corresponding service and start it.



Note:
If the agents are already started but there is still no monitoring data, you can proceed with the troubleshooting.
If the agents have not been installed, your CVM instance cannot be monitored and you will not receive a notification when the CVM instance runs abnormally, which can pose a high risk. For more information on the installations of agents, see Installing CVM Agents.

Step 2. Check reporting domains

The following 4 domains need to be resolved for the agents to run properly:
update2.agent.tencentyun.com
receiver.barad.tencentyun.com
custom.message.tencentyun.com
metadata.tencentyun.com
The procedures for checking and fixing the reporting domains are different for Linux and Windows. You can refer to a procedure as needed.
Linux
Windows
1. Check whether the reporting domains can be resolved properly. Run the following commands to check whether these 4 domains can be resolved properly:
ping -c 1 update2.agent.tencentyun.com
ping -c 1 receiver.barad.tencentyun.com
ping -c 1 custom.message.tencentyun.com
ping -c 1 metadata.tencentyun.com
In normal cases, these 4 domains can be resolved on the CVM instance. If unknown host is displayed, the domains fail to be resolved. You can proceed to the next step to fix it.
2. Fix DNS resolution. Tencent Cloud provides reliable private network DNS servers in different regions. We recommend you not overwrite the default DNS configurations. If you need to modify them, fix the resolution for the 4 domains above as follows:
1. If you use a self-built/third-party DNS service, we recommend you not add the private network DNS provided by Tencent Cloud in /etc/resolv.conf. For more information, see Private Network Access.
2. If you use a self-built DNS service, you can also add the 4 domains above to your DNS. The domain and IP mappings are as follows:
Domain Name
IP
update2.agent.tencentyun.com
169.254.0.15
receiver.barad.tencentyun.com
169.254.0.4
custom.message.tencentyun.com
169.254.0.5
metadata.tencentyun.com
169.254.10.10
3. If the two methods above cannot work, you can add the following configuration to the `/etc/hosts` file on the server:
169.254.0.15 update2.agent.tencentyun.com
169.254.0.4 receiver.barad.tencentyun.com
169.254.0.5 custom.message.tencentyun.com
169.254.10.10 metadata.tencentyun.com
Note:
After the domain resolution issue is fixed, check whether the domains can be resolved properly. If yes, wait 3 minutes and then go to the CVM console to confirm whether there is monitoring data.
1. Check whether the reporting domains can be resolved properly. Run the following commands to check whether these 4 domains can be resolved properly:
ping -n 1 update2.agent.tencentyun.com
ping -n 1 receiver.barad.tencentyun.com
ping -n 1 custom.message.tencentyun.com
ping -n 1 metadata.tencentyun.com
In normal cases, these 4 domains can be resolved on the CVM instance. If "host not found" is displayed, the domain resolution fails. In this way, you can fix the resolution as follows:
2. Fix DNS resolution.
Tencent Cloud provides reliable private network DNS servers in different regions. We recommend you not overwrite the default DNS configurations. If you need to modify them, fix the resolution for the 4 domains above as follows:
1. Log in to the Windows CVM instance.
2. On the operating system UI, open Control Panel > Network and Sharing Center > Change adapter settings.
3. Right-click Ethernet and select Properties to open the Ethernet Properties window.
4. In the Ethernet Properties window, double-click Internet Protocol Version 4 (TCP/IPv4) as shown below:


5. Select Use the following DNS server addresses and modify the DNS IP based on the corresponding region in the Private Network Access > Private Network DNS list. After the modification, click OK.


6. If the method above does not work, you can add the following configuration to the C:\\Windows\\System32\\drivers\\etc\\hosts file:
169.254.0.15 update2.agent.tencentyun.com
169.254.0.4 receiver.barad.tencentyun.com
169.254.0.5 custom.message.tencentyun.com
169.254.10.10 metadata.tencentyun.com
7. Run services.msc. Then, right-click the QCloud BaradAgent Monitor and QCloud Stargate Manager services and click Restart the service.


Note:
After fixing the UUID, wait 3 minutes and then go to the CVM console to confirm whether there is monitoring data.
If there is still no monitoring data after the restart, uninstall and reinstall the agents as instructed in Installing CVM Agents.


Step 3. Check whether the UUID is correct

Currently, the incorrect UUID configuration issue occurs only in Linux OS. For details, see the following directions.
1. Log in to the CVM console and go to the instance detail page to view the UUID.


2. Log in to the CVM instance and run the following command to view the UUID:
cat /sys/class/dmi/id/product_serial
If the UUID on the server is different from that displayed in the CVM console, run the following command as the root account to fix the UUID and restart the agents:
echo `cat /etc/uuid |awk -F '= ' '{print $NF}'` > /etc/uuid_to_serial; mount --bind /etc/uuid_to_serial /sys/class/dmi/id/product_serial
cd /usr/local/qcloud/stargate/admin
./restart.sh
cd /usr/local/qcloud/monitor/barad/admin
./stop.sh
./trystart.sh
Note:
After fixing the UUID, wait 3 minutes and then go to the CVM console to confirm whether there is monitoring data.

Step 4. Check the CVM instance log

After the CVM instance is shut down, the instance's agents will be taken offline and thus no data will be reported. When you perform CVM Ops operations such as restart, upgrade, reinstallation, or image creation through the CVM console or through logging in to the CVM instance, the reporting of the CVM monitoring data may time out and the agents will be taken offline.
Troubleshooting: You can access the detail page of the CVM instance and view the operation logs to determine whether any relevant Ops operations were performed on the CVM instance at that time.




Step 5. Check the CVM instance load

The agents may fail to report data if the CPU usage, memory usage, or bandwidth utilization of the CVM instance is too high.
Troubleshooting methods:
High CPU usage: For detailed troubleshooting directions, see CVM CPU/Memory Usage Is Too High.
High memory usage: You can log in to the CVM instance or view the monitoring charts to check whether the CPU usage reaches 100%. If yes, upgrade the CVM configuration as needed.
High bandwidth utilization: For detailed troubleshooting directions, see CVM Bandwidth Utilization Is Too High.

Help and Support

Was this page helpful?

Help us improve! Rate your documentation experience in 5 mins.

Feedback