tencent cloud

Exception Diagnosis
Last updated: 2025-07-10 15:44:21
Exception Diagnosis
Last updated: 2025-07-10 15:44:21
The exception diagnosis feature provides you with real-time performance monitoring, health inspections, and failure diagnosis, so that you can intuitively know the real-time operation status of database instances, locate newly appeared performance exceptions in real time.

Overview



Viewing Diagnosis Information

1. Log in to the DBbrain Console.
2. In the left sidebar, choose Performance Optimization.
3. Select the corresponding database type and instance ID at the top, and select the Exception Diagnosis tab.
4. On the right side of the page, select to view real-time or historical diagnosis information.

5. View the health score trend chart, diagnosed exception events, and instance architecture diagram within the selected timeline.
View health score trend chart
Click any time point on the trend chart to display the health score.

View diagnosis event bar chart
Hover over the diagnosis event bar chart to display information such as risk level, overview, and start/end time. Click the bar chart to enter the Event Details page to view information including event details, on-site descriptions, intelligent analysis, and optimization suggestions. For more information on viewing event details, see Exception Alarms.


View real-time data for health score and instance architecture diagram.
Health score: Real-time data will be displayed for health score, CPU utilization, memory utilization, connection utilization, read request hit rate, inbound traffic utilization, and outbound traffic utilization.
Click Details under the health score to enter the Health Report page and view the health score, score details, and health report.
Instance architecture diagram: Displays the Proxy and node architecture of the instance, including the location of nodes triggering alerts. Hover over the corresponding node or Proxy to display average metrics for the selected node.


Viewing Diagnosis Prompts

The diagnosis event levels are categorized as healthy, notice, warning, severe, and critical. DBbrain performs regular health checks on the instance every 10 minutes.
1. Log in to the DBbrain Console.
2. In the left sidebar, choose Performance Optimization.
3. Select the corresponding database type and instance ID at the top, and select the Exception Diagnosis tab.
4. On the right side of the page, select to view real-time or historical diagnosis information.
Real-Time: Select real-time to display the risk distribution and diagnosis details for the last three hours.
Historical: Select history to display the risk distribution and diagnosis details for the selected time period.
5. View the diagnosis prompts for the selected time range.

View diagnosis event details
In the Diagnosis Details, click the row of a specific event alarm or hover over the event alarm and click View to enter the Event Details page and view the event details.
Event details mainly include event details, on-site descriptions, intelligent analysis, and optimization suggestions. The event details displayed vary depending on the diagnosis type. Refer to the actual display.
Event Details: They include the diagnosis item, start/end time, risk level, and overview.
Description: They include problem snapshots and performance trends of the exception or health inspection events.

Ignore/Unignore alarms
In the Diagnosis Details, hover over the event alarm and click Ignore to select Ignore this item or Ignore this type, and click OK. You can also ignore alarms on the Event Details page.
Note:
This feature is only for exception alarms with diagnosis items other than "Health Inspection."
Ignore This: It means you can only ignore this alarm.
Ignore This Type: It means you can ignore exception alarms generated from the same root cause.
Ignored diagnosis events will be grayed out. To unignore, you can also click Unignore.

Detailed Description of Diagnosis Items

Diagnosis items related to intelligent diagnosis are categorized into four types: performance, availability, reliability, and maintainability. Each diagnosis item belongs to one category only.
Name of Diagnosis Items
Type of Diagnosis Items
Note:
Risk Level Classification
Node CPU Utilization
Performance
Node CPU utilization is too high.
Critical: node CPU utilization ≥ 95
Serious: 95<node CPU utilization ≥ 90
Alarm: 90<node CPU utilization ≥ 80
Note: 80<node CPU utilization ≥ 60
Node Memory Utilization
Performance
Node memory utilization is too high.
Critical: node memory utilization ≥ 95
Serious: 95<node memory utilization ≥ 90
Alarm: 90<node memory utilization ≥ 80
Note: 80<node memory utilization ≥ 60
Node Connection Utilization
Performance
Node connection utilization is too high.
Critical: Node Connection Utilization Rate ≥ 95
Serious: 95<node connection utilization ≥ 90
Alarm: 90<node connection utilization ≥ 80
Note: 80<node connection utilization ≥ 60
Proxy Connection Utilization
Performance
Proxy connection utilization is too high.
Critical: proxy connection utilization ≥ 95
Serious: 95<proxy connection utilization ≥ 90
Alarm: 90<proxy connection utilization ≥ 80
Note: 80<proxy connection utilization ≥ 60

Proxy Inflow Utilization
Performance
Proxy inbound traffic usage is too high.
Critical: Proxy inbound traffic usage ≥ 1536
Serious: 1536<proxy inbound traffic usage ≥ 1228.8
Alarm: 1228.8<proxy inbound traffic usage ≥ 1024
Note: 1024<proxy inbound traffic usage ≥ 800
Proxy Outflow Utilization
Performance
Proxy outbound traffic usage is too high.
Critical: proxy outbound traffic usage ≥ 1536
Serious: 1536<proxy outbound traffic usage ≥ 1228.8
Alarm: 1228.8<proxy outbound traffic usage ≥ 1024
Note: 1024<proxy outbound traffic usage ≥ 800
Proxy Inflow Limit Occur
Performance
Proxy Inbound Traffic Throttling
Critical
Proxy Outflow Limit Occur
Performance
Proxy Inbound Traffic Throttling
Critical
Error Command
Maintainability
There are error commands detected.
Alarm
Risk
Maintainability
There are high-risk commands detected.
Alarm
Connectivity Health Check
Availability
Database connection error, unable to connect to the database instance.
Critical
Was this page helpful?
You can also Contact Sales or Submit a Ticket for help.
Yes
No

Feedback