Controller Broker

Within a Kafka cluster, one broker is elected as the Controller -- the cluster-wide leader responsible for administrative coordination. This is the Leader and Follower pattern applied at the cluster level.

Think first

In a cluster of multiple brokers, someone needs to make cluster-wide decisions like assigning partition leaders and detecting failures. What would go wrong if every broker tried to make these decisions independently?

What the Controller does

Responsibility	Detail
Topic management	Creating/deleting topics, adding partitions
Partition leader assignment	Assigns a leader broker to each partition
Broker health monitoring	Detects broker failures and triggers failover
Leader election	When a broker dies, reassigns its partition leadership to ISR members
Metadata propagation	Communicates partition leadership changes to all brokers

Think first

If only one broker is the controller, what happens when it becomes temporarily unresponsive (say, due to a long garbage collection pause)? The cluster elects a new controller -- but then the old one wakes up. Now you have two brokers both believing they are the controller. What kind of inconsistencies could this cause, and how might you prevent them?

Split-brain: the zombie controller problem

When the controller broker becomes unresponsive (GC pause, network partition), the cluster elects a new controller. But what if the old controller comes back? Now there are two controllers -- a classic split-brain scenario.

Stop-the-world GC pause

During garbage collection, Java (which Kafka runs on) can pause all application threads for seconds. During this pause, the broker appears dead to the rest of the cluster. If the pause exceeds ZooKeeper's session timeout, a new controller is elected -- and when the GC pause ends, the old controller wakes up thinking it's still in charge.

Solution: epoch numbers

Kafka solves split-brain with a generation clock (epoch number):

Every time a new controller is elected, the epoch number increments (old controller had epoch 1, new has epoch 2)
The epoch is included in every request from the controller to other brokers
Brokers reject requests from any controller with a lower epoch than the highest they've seen
The epoch is persisted in ZooKeeper, so it survives restarts

When the zombie controller (epoch 1) tries to issue commands, every broker has already seen epoch 2 and ignores it. The zombie eventually discovers it's been superseded and steps down.

Interview angle

Controller broker + split-brain + epoch numbers is a complete story for "How does Kafka handle controller failures?" Walk through: (1) controller dies or pauses, (2) new controller elected with higher epoch, (3) old controller wakes up, (4) its commands are rejected because brokers have seen the higher epoch. This shows you understand both the problem and the solution.

Quiz

A Kafka cluster has a controller with epoch 5. A GC pause makes it unresponsive, and a new controller is elected with epoch 6. The old controller wakes up and tries to reassign Partition 3's leader. What happens?

The reassignment succeeds because the old controller still has a valid ZooKeeper session.

Both controllers' commands are accepted, causing a split-brain with two leaders for Partition 3.

Brokers reject the old controller's command because they have already seen epoch 6, and the zombie controller eventually discovers it has been superseded.

ZooKeeper blocks the old controller's command before it reaches any broker.

What the Controller does​

Split-brain: the zombie controller problem​

Solution: epoch numbers​

What the Controller does

Split-brain: the zombie controller problem

Solution: epoch numbers