Release Notes and Announcements
- Release Notes
- Announcements
User Tutorial
Product Introduction
- Overview
- Product Strengths
- Use Cases
- Storage Engine
- Product Series
- Product Versions
- Specifications and Performance
- Read/Write Separation
- Multi-AZ Deployment
- Regions and AZs
- Terms
- Service Regions and Service Providers
Purchase Guide
Getting Started
- Quickly Creating an Instance
- Connecting to an Instance (Redis/Valkey Edition)
Operation Guide
- Operation Overview
- Connecting to a Database Instance
- Managing Instances
- Upgrade Instance
- Management Node (Redis/ValKey Edition)
- Multi-AZ Deployment Management
- Backup and Restoration
- Managing Accounts
- Parameter Configuration
- Slow Query
- Access Management
- Network and Security
- Monitoring and Alarms
- Event Management (Redis/ValKey Edition)
- Data Migration
- Global Replication for Redis Edition
- Database Audit
- Performance Optimization
- Sentinel Mode
Development Guidelines
- Naming Rules
- Basic Usage Guidelines
- Design Principles of Key and Value
- Command Usage Guidelines
- Design Principles of Client Programs
- Connection Pool Configuration
Command Reference
- Command Reference Overview
- Redis Edition and Valkey Edition Command Compatibility
- Version Command Usage Differences
- Differences Between the Proxy Architecture and Direct Connection Mode
- More Command Operations (Redis/Valkey Edition)
- Memcached Edition Command Compatibility
Practical Tutorial
- Building TencentDB for Redis® Client Monitoring Based on Spring Boot
- Redis Client Connection Configuration Policy and Practice
- Global SCAN Guide for Cluster Architecture
- Eliminating Instances Securely
- Hot Key and Big Key
- AZ Migration Scheme
Troubleshooting
- Connection Exception
- Exception Analysis and Solution of Redisson Client Timeout Reconnection
- Performance Troubleshooting and Fine-Tuning
API Documentation
- History
- Introduction
- API Category
- Making API Requests
- Instance APIs
- Parameter Management APIs
- Other APIs
- Backup and Restoration APIs
- Region APIs
- Monitoring and Management APIs
- Log APIs
- Data Types
- Error Codes
FAQs
Service Agreement
- Service Level Agreement
- Terms of Service
Contact Us
Glossary

Exception Diagnosis

Download

フォーカスモード

フォントサイズ

最終更新日: 2026-03-19 16:56:26

The exception diagnosis feature provides real-time performance monitoring, health inspection, and failure diagnosis for your database instances. It allows you to intuitively perceive the real-time operation status and locate performance exceptions as they occur.
Overview
﻿
Viewing the Diagnostic Information
1. Log in to the TencentDB for DBbrain (DBbrain) console.
2. In the left sidebar, select Performance Optimization.
3. At the top of the page, select the database type and instance ID, and select the Exception Diagnosis tab.
4. On the right side of the page, select to view real-time or historical diagnosis information.
﻿
5. View the health score trend chart, diagnosed exception events, and instance architecture diagram within the selected time period.
Viewing the health score trend chart
Click any time point on the trend chart to display the health score at that time point.
﻿
Viewing the diagnosis event bar chart
Hover the cursor over the diagnosis event bar chart to display information such as risk level, overview, and start/end time. Click the bar chart to go to the event details page to view information such as event details, description, intelligent analysis, and optimization suggestions. For more information about how to view event details, see Exception Alarms.
﻿
﻿
Viewing real-time data for health score and instance architecture diagram
Health score: Real-time data will be displayed for health score, CPU utilization, memory utilization, connection utilization, read request hit rate, inbound traffic utilization, and outbound traffic utilization.
You can click Details under the health score to go to the health report page and view the health score, score details, and health report.
Instance architecture diagram: displays the proxy and node architecture of the instance, including the location of nodes triggering alerts. Hover the cursor over a node or proxy to display the corresponding average metric values.
﻿
Viewing the Diagnostic Prompts
Diagnosis events are categorized into Healthy, Note, Alarm, Serious, and Critical levels. DBbrain performs regular health inspections on instances every 10 minutes.
1. Log in to the DBbrain console.
2. In the left sidebar, select Performance Optimization.
3. At the top of the page, select the database type and instance ID, and select the Exception Diagnosis tab.
4. On the right side of the page, select to view real-time or historical diagnosis information.
Real-time: Select Real-time to display the risk distribution and diagnosis details for the last three hours.
Historical: Select Historical to display the risk distribution and diagnosis details for the selected time period.
5. View the diagnostic prompts for the selected time range.
﻿
﻿Viewing the diagnosis event details
In the Diagnosis Details section, click the row of a specific event alarm, or hover the cursor over the event alarm and click View to go to the event details page and view the event details.
Event details mainly include event details, description, intelligent analysis, and optimization suggestions. The event details displayed vary depending on the diagnosis type. The details actually displayed shall prevail.
Event details: includes information such as diagnosis item, start/end time, risk level, and overview.
Description: includes information such as problem snapshots and performance trends of the exception or health inspection event.
﻿
Ignoring or unignoring an alarm
Note:
This feature is only applicable to exception alarms for which the diagnosis item is not Health Inspection.
In the Diagnosis Details section, hover the cursor over an event alarm, click Ignore, and select Ignore This Item or Ignore This Type, and then click OK.
Ignore This Item: ignores only this specific alarm.
Ignore This Type: ignores exception alarms caused by the same root cause.
Ignored diagnosis events will be displayed in gray.
To unignore, click Unignore and then click OK.
To display ignored alarms, select Show Ignored.
Click Ignore Management to view ignored content and ignored types. 
You can go to the event details page and click Ignore or Unignore in the upper-right corner. 
Detailed Description of Diagnosis Items
Diagnosis items are the objects for intelligent diagnosis. Their types include performance, availability, reliability, and maintainability. Each diagnosis item belongs to only one type.
Diagnosis Item Name
Diagnosis Item Type
Description
Risk level classification
Node CPU utilization
Performance
Excessively high CPU utilization of a node
Critical: Node CPU utilization ≥ 95
Serious: 90 ≤ Node CPU utilization < 95
Alarm: 80 ≤ Node CPU utilization < 90
Note: 60 ≤ Node CPU utilization < 80
Node memory utilization
Performance
Excessive memory utilization of a node
Critical: Node memory utilization ≥ 95
Serious: 90 ≤ Node memory utilization < 95
Alarm: 80 ≤ Node memory utilization < 90
Note: 60 ≤ Node memory utilization < 80
Node connection utilization
Performance
Excessive connection utilization of a node
Critical: Node connection utilization ≥ 95
Serious: 90 ≤ Node connection utilization < 95
Alarm: 80 ≤ Node connection utilization < 90
Note: 60 ≤ Node connection utilization < 80
Proxy connection utilization
Performance
Excessive connection utilization of a proxy
Critical: Proxy connection utilization ≥ 95
Serious: 90 ≤ Proxy connection utilization < 95
Alarm: 80 ≤ Proxy connection utilization < 90
Note: 60 ≤ Proxy connection utilization < 80
﻿
Proxy inbound traffic usage
Performance
Excessive inbound traffic usage of a proxy
Critical: Proxy inbound traffic usage ≥ 1536
Serious: 1228.8 ≤ Proxy inbound traffic usage < 1536 
Alarm: 1024 ≤ Proxy inbound traffic usage < 1228.8
Note: 800 ≤ Proxy inbound traffic usage < 1024
Proxy outbound traffic usage
Performance
Excessive outbound traffic usage of a proxy
Critical: Proxy outbound traffic usage ≥ 1536
Serious: 1228.8 ≤ Proxy outbound traffic usage < 1536 
Alarm: 1024 ≤ Proxy outbound traffic usage < 1228.8
Note: 800 ≤ Proxy outbound traffic usage < 1024
Proxy inbound traffic throttling
Performance
Proxy inbound traffic throttling
Critical
Proxy outbound traffic throttling
Performance
Proxy outbound traffic throttling
Critical
Incorrect commands
Maintainability
Existence of incorrect commands
Alarm
High-risk commands
Maintainability
Existence of high-risk commands
Alarm
Connectivity check
Availability
Database connection exception: The database instance cannot be connected
Critical

ヘルプとサポート

この記事はお役に立ちましたか？

営業担当者にお問い合わせいただくかチケットを提出してサポートを求めることができます。

フィードバック

tencent cloud

Tencent Cloud Distributed Cache (Redis OSS-Compatible)

Exception Diagnosis

Overview

Viewing the Diagnostic Information

Viewing the Diagnostic Prompts

Detailed Description of Diagnosis Items

ヘルプとサポート

Diagnosis Item Name	Diagnosis Item Type	Description	Risk level classification
Node CPU utilization	Performance	Excessively high CPU utilization of a node	Critical: Node CPU utilization ≥ 95 Serious: 90 ≤ Node CPU utilization < 95 Alarm: 80 ≤ Node CPU utilization < 90 Note: 60 ≤ Node CPU utilization < 80
Node memory utilization	Performance	Excessive memory utilization of a node	Critical: Node memory utilization ≥ 95 Serious: 90 ≤ Node memory utilization < 95 Alarm: 80 ≤ Node memory utilization < 90 Note: 60 ≤ Node memory utilization < 80
Node connection utilization	Performance	Excessive connection utilization of a node	Critical: Node connection utilization ≥ 95 Serious: 90 ≤ Node connection utilization < 95 Alarm: 80 ≤ Node connection utilization < 90 Note: 60 ≤ Node connection utilization < 80
Proxy connection utilization	Performance	Excessive connection utilization of a proxy	Critical: Proxy connection utilization ≥ 95 Serious: 90 ≤ Proxy connection utilization < 95 Alarm: 80 ≤ Proxy connection utilization < 90 Note: 60 ≤ Proxy connection utilization < 80
Proxy inbound traffic usage	Performance	Excessive inbound traffic usage of a proxy	Critical: Proxy inbound traffic usage ≥ 1536 Serious: 1228.8 ≤ Proxy inbound traffic usage < 1536 Alarm: 1024 ≤ Proxy inbound traffic usage < 1228.8 Note: 800 ≤ Proxy inbound traffic usage < 1024
Proxy outbound traffic usage	Performance	Excessive outbound traffic usage of a proxy	Critical: Proxy outbound traffic usage ≥ 1536 Serious: 1228.8 ≤ Proxy outbound traffic usage < 1536 Alarm: 1024 ≤ Proxy outbound traffic usage < 1228.8 Note: 800 ≤ Proxy outbound traffic usage < 1024
Proxy inbound traffic throttling	Performance	Proxy inbound traffic throttling	Critical
Proxy outbound traffic throttling	Performance	Proxy outbound traffic throttling	Critical
Incorrect commands	Maintainability	Existence of incorrect commands	Alarm
High-risk commands	Maintainability	Existence of high-risk commands	Alarm
Connectivity check	Availability	Database connection exception: The database instance cannot be connected	Critical