Skip to content

Widhian Bramantya

coding is an art form

Menu
  • About Me
Menu
kafka

Kafka Architecture Explained: Brokers, Topics, Partitions, and Offsets

Posted on September 14, 2025September 14, 2025 by admin

Apache Kafka may look complex at first, but its main building blocks are simple. If you understand brokers, topics, partitions, and offsets, you can understand how Kafka works.

What is a Broker?

A broker is a Kafka server.

  • Each broker stores messages and serves producers and consumers.
  • Kafka is usually run as a cluster with many brokers working together.
  • If one broker goes down, the others keep the system running.

Think of a broker as a post office that receives and delivers messages.

What is a Topic?

A topic is like a folder or category of messages.

  • Producers send messages into a topic.
  • Consumers read messages from a topic.
  • Example:
    • Topic orders stores all purchase events.
    • Topic users stores all user registration events.

What is a Partition?

Each topic can be split into partitions.

  • A partition is an ordered log where messages are stored in sequence.
  • Instead of putting all messages in a single log, Kafka splits them into partitions.
  • Example: Topic orders may have 3 partitions:
    • orders-0
    • orders-1
    • orders-2

Why Do We Need Partitions?

  1. Scalability
    • If there was only 1 partition, only 1 consumer could read it at a time.
    • With many partitions, Kafka can split work across multiple consumers in a consumer group.
    • This means more throughput and faster processing.
  2. Distribution
    • Partitions can be stored on different brokers (servers).
    • This spreads the load across the cluster, instead of putting all the pressure on one machine.
    • Result: Kafka can handle huge volumes of data.
  3. Replication and Fault Tolerance
    • Each partition can have replicas on different brokers.
    • If one broker fails, another broker has a copy and can continue serving data.
    • This makes Kafka highly reliable in a distributed system.
See also  What is Debezium? – An Introduction to Change Data Capture

👉 In short: Partitions make Kafka both scalable and fault-tolerant. Without partitions, Kafka would be just a single log, and it would not be able to handle big data at large scale.

Pages: 1 2 3
Category: Kafka

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Linkedin

Widhian Bramantya

Recent Posts

  • Log Management at Scale: Integrating Elasticsearch with Beats, Logstash, and Kibana
  • Index Lifecycle Management (ILM) in Elasticsearch: Automatic Data Control Made Simple
  • Blue-Green Deployment in Elasticsearch: Safe Reindexing and Zero-Downtime Upgrades
  • Maintaining Super Large Datasets in Elasticsearch
  • Elasticsearch Best Practices for Beginners
  • Implementing the Outbox Pattern with Debezium
  • Production-Grade Debezium Connector with Kafka (Postgres Outbox Example – E-Commerce Orders)
  • Connecting Debezium with Kafka for Real-Time Streaming
  • Debezium Architecture – How It Works and Core Components
  • What is Debezium? – An Introduction to Change Data Capture
  • Offset Management and Consumer Groups in Kafka
  • Partitions, Replication, and Fault Tolerance in Kafka
  • Delivery Semantics in Kafka: At Most Once, At Least Once, Exactly Once
  • Producers and Consumers: How Data Flows in Kafka
  • Kafka Architecture Explained: Brokers, Topics, Partitions, and Offsets
  • Getting Started with Apache Kafka: Core Concepts and Use Cases
  • Security Best Practices for RabbitMQ in Production
  • Understanding RabbitMQ Virtual Hosts (vhosts) and Their Uses
  • RabbitMQ Performance Tuning: Optimizing Throughput and Latency
  • High Availability in RabbitMQ: Clustering and Mirrored Queues Explained

Recent Comments

  1. Playing with VPC AWS (Part 2) – Widhian's Blog on Playing with VPC AWS (Part 1): VPC, Subnet, Internet Gateway, Route Table, NAT, and Security Group
  2. Basic Concept of ElasticSearch (Part 3): Translog, Flush, and Refresh – Widhian's Blog on Basic Concept of ElasticSearch (Part 1): Introduction
  3. Basic Concept of ElasticSearch (Part 2): Architectural Perspective – Widhian's Blog on Basic Concept of ElasticSearch (Part 3): Translog, Flush, and Refresh
  4. Basic Concept of ElasticSearch (Part 3): Translog, Flush, and Refresh – Widhian's Blog on Basic Concept of ElasticSearch (Part 2): Architectural Perspective
  5. Basic Concept of ElasticSearch (Part 1): Introduction – Widhian's Blog on Basic Concept of ElasticSearch (Part 2): Architectural Perspective

Archives

  • October 2025
  • September 2025
  • August 2025
  • November 2021
  • October 2021
  • August 2021
  • July 2021
  • June 2021
  • March 2021
  • January 2021

Categories

  • Debezium
  • Devops
  • ElasticSearch
  • Golang
  • Kafka
  • Lua
  • NATS
  • Programming
  • RabbitMQ
  • Redis
  • VPC
© 2025 Widhian Bramantya | Powered by Minimalist Blog WordPress Theme