Setting up a Streaming Data Pipeline With Kafka

Apache Kafka is a stream-processing framework that implements a message bus. It is a Scala and Java-based open-source development platform, which was developed by the Apache Software Foundation. The goal of Kafka is to provide a single, high-throughput, and low-latency platform for real-time data flows. Apache Kafka uses Zookeeper for naming and registry services.

Zookeeper is a naming registry that is used in distribution systems for service synchronization. In Kafka, the Zookeeper is responsible for managing and tracking the status of the Kafka cluster’s nodes, topics, and messages.

In this project, we’ll learn and have the hands-on configuration of Zookeeper and Kafka. After that, we’ll learn to start both services. We’ll create a topic using the terminal and, after that, we’ll create a console-based consumer and producer to check the proper data streamflow.

1.Introduction

2.Apache Kafka Producer API

Breakout Session

3.Apache Kafka Consumer API

Project

4.Kafka Streams

Breakout Session

5.Kafka Connect

Breakout Session

6.Exploring Projects in the Kafka Ecosystem

7.Wrap Up

Mini Project

Mock Interview

Setting up a Streaming Data Pipeline With Kafka