Distributed Coordination in Kafka
Learn how producers and consumers operate in distributed systems.
There are three ways a producer can publish messages to partitions. It can either publish a message to a particular partition, a random partition, or by selecting a partition by applying a partition function on a message key. In this lesson, we’ll see how the consumer will interact with multiple brokers in parallel.
Coordination in consumer groups
Certain features are there in Kafka to help achieve scalability goals. Those features are explained below.
Goals of coordination
Evenly distribute the messages (stored in the brokers) across multiple consumers.
Concede the lowest possible coordination overhead.
Consumption of messages
Partitions are made to be the smallest unit of parallelism in Kafka. Each smallest unit of parallelism needs to be consumed only by a single consumer. This means only one consumer consumes all the messages in a specific partition of a topic.
If multiple consumers were allowed to consume a partition, they would have had to communicate and decide which messages would be consumed by whom, incurring a locking and state maintenance overhead.
Level up your interview prep. Join Educative to access 70+ hands-on prep courses.