tencent cloud

TDMQ for CKafka

Release Notes and Announcements
Release Notes
Broker Release Notes
Announcement
Product Introduction
Introduction and Selection of the TDMQ Product Series
What Is TDMQ for CKafka
Strengths
Scenarios
Technology Architecture
Product Series Introduction
Apache Kafka Version Support Description
Comparison with Apache Kafka
High Availability
Use Limits
Regions and AZs
Related Cloud Services
Billing
Billing Overview
Pricing
Billing Example
Changing from Postpaid by Hour to Monthly Subscription
Renewal
Viewing Consumption Details
Overdue Payments
Refund
Getting Started
Guide for Getting Started
Preparations
VPC Network Access
Public Domain Name Access
User Guide
Usage Process Guide
Configuring Account Permission
Creating Instance
Configuring Topic
Connecting Instance
Managing Messages
Managing Consumer Group
Managing Instance
Changing Instance Specification
Configuring Traffic Throttling
Configuring Elastic Scaling Policy
Configuring Advanced Features
Viewing Monitoring Data and Configuring Alarm Rules
Synchronizing Data Using CKafka Connector
Use Cases
Cluster Resource Assessment
Client Practical Tutorial
Log Integration
Open-Source Ecosystem Integration
Replacing Supporting Route (Old)
Migration Guide
Migration Solution Overview
Migrating Cluster Using Open-Source Tool
Troubleshooting
Topics
Clients
Messages
​​API Reference
History
Introduction
API Category
Making API Requests
Other APIs
ACL APIs
Instance APIs
Routing APIs
DataHub APIs
Topic APIs
Data Types
Error Codes
SDK Reference
SDK Overview
Java SDK
Python SDK
Go SDK
PHP SDK
C++ SDK
Node.js SDK
SDK for Connector
Security and Compliance
Permission Management
Network Security
Deletion Protection
Event Record
CloudAudit
FAQs
Instances
Topics
Consumer Groups
Client-Related
Network-Related
Monitoring
Messages
Agreements
CKafka Service Level Agreements
Contact Us
Glossary
문서TDMQ for CKafkaUser GuideSynchronizing Data Using CKafka ConnectorViewing Monitoring Data and Configuring Alarm Rules

Viewing Monitoring Data and Configuring Alarm Rules

PDF
포커스 모드
폰트 크기
마지막 업데이트 시간: 2026-01-20 17:02:41

Scenarios

The TDMQ for CKafka (CKafka) connector supports daily monitoring of data synchronization tasks under your account. You can view detailed monitoring data in real time in the console to understand the health status of the tasks. You can also configure alarm rules for key metrics. When monitoring metrics reach the set alarm threshold, Tencent Cloud Observability Platform (TCOP) will notify you via email, Short Message Service (SMS), WeChat, or phone, helping you promptly detect cluster issues and handle them while ensuring the stable running of the tasks.

Monitoring Metrics and Meanings

Metric Type
Task Metric
Meaning
Unit
Read side
SetSourceRecordPollRate
Number of messages read per second from a Kafka topic at the data source.
Count/s
SetUpstreamBytesPollRate
Number of messages read per second from a Kafka topic at the data source.
MB/s
SetTotalRecordErrors
Number of messages that failed to be read from a Kafka topic at the data source.
Count/s
Write side
SetSourceRecordWriteRate
Number of messages written per second to the data target.
Count/s
SetDownstreamBytesSendRate
Total traffic of messages written per second to the data target.
MB/s
SetPollBatchMaxTimeMs
Maximum time difference from message reading to completion of writing
ms
Task performance metric
SetConnectorTasksMax
Number of active concurrent tasks.
Count
SetTaskHealthy
1 indicates healthy; 0 indicates unhealthy (such as exception or failure).
None

Viewing Monitoring Data

1. Log in to the CKafka console.
2. Choose Connector > Task List and click the ID of the target task to go to the basic information page.
3. On the basic information page, select the Monitoring tab at the top and set the time range to view the corresponding monitoring metrics.
On the monitoring page, you can perform the following operations:
Operation
Icon
Description
Comparing time



You can click the week-on-week and day-on-day comparison icon to select week-on-week (same period last week), day-on-day (same period yesterday), or a custom date range to compare changes in the cluster status across different time periods.
Setting the refresh interval



You can click the refresh icon on the left side to update the chart. Click the drop-down menu on the right side to select the automatic refresh interval for the entire chart. You can set the interval to 1 minute or 5 minutes.
Replicating a chart to the dashboard



You can click the icon to replicate a chart to the dashboard. For more information about the dashboard, see What Is a Dashboard.
Displaying the legend



You can check this box to display legend information in the chart.

Alarm Configuration Recommendations

This section describes key metrics that require special attention and recommended alarm configurations during the use of the CKafka connector:
Metric
Alarm Configuration Recommendation
Alarm Handling Recommendation
set_total_record_errors
Set the statistical granularity to 1 minute. If the value of Data Read Failures is greater than 100 counts/s for 3 consecutive data points, trigger an alarm every 15 minutes.
1. Check whether the upstream systems are running properly and whether data can be read normally.
2. Check whether the upstream systems have network issues, as network failures may cause data to be unreadable.
3. Check whether the data format of upstream systems has been adjusted.
If the exception persists after you exclude the above reasons, contact us.
set_task_healthy
Set the statistical granularity to 1 minute. If the value of Task Health Status is 0 for 3 consecutive data points, trigger an alarm every 15 minutes.
Check whether the upstream and downstream systems are running properly. If they are running properly and data can be read and written normally, but the task status is abnormal, contact us.
set_source_resource_health
Set the statistical granularity to 1 minute. If the value of Source Connection Health Status is 0 for 3 consecutive data points, trigger an alarm every 15 minutes.
Check whether the upstream (source) systems provide services properly. If they provide services normally and the network status is normal, but the connection status is abnormal, contact us.
Target connection health status
Set the statistical granularity to 1 minute. If the value of Target Connections is 0 for 3 consecutive data points, trigger an alarm every 15 minutes.
Check whether the downstream (target) systems provide services normally. If they provide services normally and the network status is normal, but the connection status is abnormal, contact us.

Configuring an Alarm Policy

1. You can select either of the following two entries on the alarm page:
Entry 1: Log in to the CKafka console. Choose Connector > Task List and click the ID of the target task to go to the basic information page. Select the Monitoring tab at the top. Click the alarm configuration icon in the upper-right corner of the monitoring chart, and you will be redirected to the Alarm Configuration page. The alarm recipient is the task type before redirection by default.

Entry 2: Log in to the TCOP console. On the Alarm Configuration page, click Create Policy. Set the monitoring type to Cloud Product Monitoring and set the policy type to ckafka_connector_set.

2. Select the task object for which you want to set an alarm in Alarm Object.
3. Set alarm trigger conditions. Select Template and Configure Manually are supported. By default, Configure Manually is selected.
Configure Manually
Select Template
Metric: Taking the set_task_healthy for example, set the statistical granularity to 1 minute. If the value of Task Health Status exceeds the threshold for N consecutive data points within 1 minute, an alarm will be triggered.
Alarm Frequency: For example, "Alarm every 30 minutes" indicates that an alarm is triggered once every 30 minutes if a metric exceeds the threshold in multiple consecutive statistical periods. Another alarm will be triggered only if the metric exceeds the threshold again in the next 30 minutes.
1. Select Select Template. Then, click Create Trigger Condition Template to redirect to the trigger condition template setup page.
2. In the upper-left corner, click New Trigger Condition Template. On the template creation page, configure the alarm policy.
Policy Type: Select a policy type under ckafka_connector_task.
Trigger Condition: Set the alarm policy based on the recommendations we provide or your actual business requirements.
3. After confirming that everything is correct, click Save and return to the alarm policy creation page. Click Refresh, and the configured alarm policy template will then appear.
Note:
For more information about the alarm configuration feature, see Configuring Metric Alarms.
4. Click Next step: Configure Alarm Notification to configure alarm recipients.
You can select a notification template preset by the system. The alarm recipient of a preset template is typically the person in charge of the root account. To notify the person in charge of the instance or other personnel, you can also click Add Notification Template to create a notification template and set alarm recipients and notification channels.
For detailed operations about how to create a notification template, see Creating a Notification Template.
5. After confirming that the information is correct, click Complete to complete the configuration of alarm rules.



도움말 및 지원

문제 해결에 도움이 되었나요?

피드백