Release Notes and Announcements
- Release Notes
- Broker Release Notes
- Announcement
Product Introduction
- Introduction and Selection of the TDMQ Product Series
- What Is TDMQ for CKafka
- Strengths
- Scenarios
- Technology Architecture
- Product Series Introduction
- Apache Kafka Version Support Description
- Comparison with Apache Kafka
- High Availability
- Use Limits
- Regions and AZs
- Related Cloud Services
Billing
Getting Started
- Guide for Getting Started
- Preparations
- VPC Network Access
- Public Domain Name Access
User Guide
- Usage Process Guide
- Configuring Account Permission
- Creating Instance
- Configuring Topic
- Connecting Instance
- Managing Messages
- Managing Consumer Group
- Managing Instance
- Changing Instance Specification
- Configuring Traffic Throttling
- Configuring Elastic Scaling Policy
- Configuring Advanced Features
- Viewing Monitoring Data and Configuring Alarm Rules
- Synchronizing Data Using CKafka Connector
Use Cases
- Cluster Resource Assessment
- Client Practical Tutorial
- Log Integration
- Open-Source Ecosystem Integration
- Replacing Supporting Route (Old)
Migration Guide
- Migration Solution Overview
- Migrating Cluster Using Open-Source Tool
Troubleshooting
- Topics
- Clients
- Messages
API Reference
- History
- Introduction
- API Category
- Making API Requests
- Other APIs
- ACL APIs
- Instance APIs
- Routing APIs
- DataHub APIs
- Topic APIs
- Data Types
- Error Codes
SDK Reference
- SDK Overview
- Java SDK
- Python SDK
- Go SDK
- PHP SDK
- C++ SDK
- Node.js SDK
- SDK for Connector
Security and Compliance
FAQs
- Instances
- Topics
- Consumer Groups
- Client-Related
- Network-Related
- Monitoring
- Messages
Agreements
- CKafka Service Level Agreements
Contact Us
Glossary

High Latency in Client Message Production

Download

포커스 모드

폰트 크기

마지막 업데이트 시간: 2026-01-20 17:19:22

Issue Symptom
The client symptom manifests as increased latency in the producer sending messages.
The message writing speed slows down, and the sending latency increases.
CPU utilization is too high.
Troubleshooting Steps
Step 1: Traffic Throttling Check
Check whether the traffic of a single topic is throttled: If a traffic throttling value is set, it restricts the message sending rate.  
Optimization suggestion: Increase the traffic throttling value for the topic (based on actual business requirements).
﻿
Check whether the traffic of the instance is throttled. If traffic throttling occurs, the bandwidth needs to be increased.
﻿
Step 2: Cluster Performance Check
Check the cluster CPU load. If the overall cluster load is high, it directly causes increased sending latency.
Optimization solution: Scale out.
﻿
Step 3: Client Parameter Optimization
Set reasonable batch parameters to reduce fragmented requests and improve sending performance. Small batches cause clients to initiate requests frequently, which increases server queuing pressure and leads to higher latency and CPU consumption.  
Optimization suggestion:  
Set acks \\batch.size and linger.ms appropriately and adjust the parameters based on business requirements. Recommended values:
acks=1
batch.size=16384
linger.ms=1000
Detailed explanation is as follows:
1. Adjusting the ack parameter 
The acks parameter controls the acknowledgment mechanism that the producer waits after sending messages:
acks=0: No server acknowledgment is required. Performance is high, but the risk of data loss is significant.
acks=1: The primary node returns an acknowledgment upon successful write. Performance is medium. This configuration is suitable for most scenarios.
acks=all or -1: The primary node and all replicas return an acknowledgment after a successful write. Data reliability is high, but performance is poor.
Optimization suggestion:  
To improve sending performance, it is recommended to set acks to 1.
2. Adjusting the batch sending parameter
Batch sending can reduce the number of requests and improve network throughput:
batch.size: Total size of messages cached per partition (in bytes). When the set value is reached, batch sending is triggered.
linger.ms: Maximum time a message stays in the cache. After the value is exceeded, the message will be sent immediately, even if batch.size is not reached.
Recommended configuration:
batch.size=16384 (default value: 16 KB).
linger.ms=1000 (default value: 0, meaning that messages are sent immediately without waiting).
3. Adjusting the cache size
buffer.memory: Controls the overall size of cached messages. When the cache exceeds the limit, it forces a send, regardless of batch.size and linger.ms.
The default value is 32 MB, which can ensure sufficient performance for a single producer.
Recommended configuration: buffer.memory ≧ batch.size * number of partitions * 2.
Note:
If multiple producers are started within the same JVM, set buffer.memory with caution to avoid OOM issues.
Key Issues
1. Causes for high CPU load
The high CPU load of Kafka typically results from the following aspects:
Production and consumption request processing:
High-throughput producers and consumers consume significant CPU resources.
Data replication:
The replica synchronization process requires performing data replication operations.
High-frequency control requests:
For example, high-frequency operations such as offset management and metadata queries consume the CPU resources of brokers.
2. How to handle high cluster load
Common symptoms of high cluster load include:
Message sending latency increases.
Request queue depth is excessively and persistently large.
The CPU or disk I/O of the broker is persistently under high load.
Solution:
2.1 Scale-out:
Increase the number of broker nodes to distribute partition load and reduce pressure on individual servers.
Optimize partition distribution to ensure partitions are evenly distributed across all brokers.
2.2 Traffic throttling:
Set reasonable production traffic throttling values to prevent producer traffic surges from overloading brokers.
3. How to optimize the client implementation for low latency under high cluster load
When the Kafka cluster cannot be scaled out or the load cannot be effectively reduced, you can optimize client configurations to reduce latency as much as possible.
See Client Parameter Optimization.  
Set acks \\batch.size and linger.ms appropriately and adjust the parameters based on business requirements. Recommended values:
acks=1
batch.size=16384
linger.ms=1000
Summary
Through the above troubleshooting steps and parameter optimization methods, you can effectively address high load on Kafka clusters and high latency in message production.
For high cluster load, scale-out is the most direct solution.
For client parameter optimization, adjusting batch sending and cache size can significantly improve performance.
If the issue persists, contact online customer service.
﻿
﻿
﻿

도움말 및 지원

문제 해결에 도움이 되었나요?

더 자세한 내용은 문의하기 또는 티겟 제출 을 통해 문의할 수 있습니다.

피드백

tencent cloud

TDMQ for CKafka

High Latency in Client Message Production

Issue Symptom

Troubleshooting Steps

Step 1: Traffic Throttling Check

Step 2: Cluster Performance Check

Step 3: Client Parameter Optimization

Key Issues

Summary

도움말 및 지원