The Fundamentals of Apache Kafka Architecture

2025-04-24 2025-04-24 约 266 字预计阅读 1 分钟

系列 - Kafka源码阅读笔记

1 Overview of Apache Kafka Architecture

The storage layer is designed to store data efficiently and is a distributed system such that if your storage needs grow over time you can easily scale out the system to accommodate the growth.

The compute layer consists of four core components—the producer, consumer, Kafka Streams, and Connect APIs, which allow Kafka to scale applications across distributed systems.

2 What Is an Event?

An event is a record of something that happened that also provides information about what happened.

Examples of events are customer orders, payments, clicks on a website, or sensor readings.

In a Kafka-based architecture, an event record consists of a timestamp, a key, a value, and optional headers.

3 Kafka Topics

Topics are append-only, immutable logs of events. Typically, events of the same type, or events that are in some way related, would go into the same topic.

3.1 Kafka Topic Partitions

In order to distribute the storage and processing of events in a topic, Kafka uses the concept of partitions. A topic is made up of one or more partitions and these partitions can reside on different nodes in the Kafka cluster.

Within the partition, each event is given a unique identifier called an offset. The offset for a given partition will continue to grow as events are added, and offsets are never reused.

4 Scala 快速入门

Scala for Java Programmers

目录