When you use consumer groups to consume data, the server manages the consumption tasks of all consumers in each consumer group, and automatically balances consumption tasks according to the relationship between topic partitions and consumer counts. At the same time, it also records the consumption progress of each topic partition to avoid repeat consumption. The process by which a consumer group consumes data is as follows:
A consumer group adjusts the consumption tasks for each consumer based on the number of active consumers and topic partitions to maintain consumption balance. At the same time, every consumer saves the consumption progress of each topic partition in order to continue the consumption after a failover and avoid repeat consumption.
For example, a log topic has two consumers. Consumer A consumes partitions 1 and 2, and consumer B consumes partitions 3 and 4. When partition 5 is added after a splitting operation, the consumer group automatically assigns partition 5 to consumer B, as shown in the following figure.
For example, a log topic has two consumers. Consumer A consumes partitions 1, 2, and 3, and consumer B consumes partitions 4, 5, and 6. In order for the consumption rate to keep up with the production rate, consumer C is added. The consumer group assigns partitions 3 and 4 to consumer C for balance, as shown in the following figure.
When the order field of a consumption group is set to true, the consumption group will consume partitions in chronological order. For example, a log topic has three partitions. After partition 3 is split into two new partitions, the consumer group first consumes data in partition 3 and then consumes data in the two new partitions derived from partition 3 in parallel, as shown in the following figure.
Was this page helpful?