Get in Touch

Course Outline

Module 1: Introduction to the Architecture and Configuration of the Confluent Apache Kafka Cluster

  • Kafka's role in modern data pipelines.
  • Differences between Apache Kafka and Confluent Kafka.
  • Core components: producers, consumers, brokers, topics, and partitions.
  • Kafka cluster deployment models and scaling considerations.

Module 2: Zookeeper Quorum Configuration

  • Understanding Zookeeper.
  • Zookeeper’s role within a Kafka cluster.
  • Determining Zookeeper Quorum size.
  • Zookeeper configuration guidelines.
  • Implementing SSH on our servers.
  • Practical exercise: Zookeeper configuration (as a team and as a service).
  • Utilising the Zookeeper Command Line Interface (CLI).
  • Practical exercise: Zookeeper Quorum configuration.
  • Exploring the Zookeeper internal file system.
  • Performance factors affecting Zookeeper.
  • Demonstration of management tools for Zookeeper and Zoonavigator.

Module 3: Kafka Cluster Configuration

  • Foundational Kafka concepts.
  • Kafka configuration parameters.
  • Practical exercise: Configuring Kafka brokers.
  • Practical exercise: Executing Kafka commands.
  • Practical exercise: Configuring a Kafka Multi-Broker Cluster.
  • Practical exercise: Testing the Kafka cluster.
  • Verifying connectivity to the Kafka cluster.
  • Configuring Advertised.listeners: the most critical setting.
  • Topic configuration details.
  • Configuration for downloading and ingesting messages into topics.
  • Practical exercise: Demonstrating Kafka resilience.
  • Kafka performance: I/O.
  • Kafka performance: Network (RED).
  • Kafka performance: RAM.
  • Kafka performance: CPU.
  • Kafka performance: Operating System (OS).
  • Kafka performance: Other factors.
  • Practical exercise: Modifying Kafka broker configuration.

Module 4: Advanced Kafka Configuration

  • Configuring Landoop Kafka topic user interface, Confluent REST Proxy, and Confluent Schema Registry.
  • Sending and receiving messages via CLI, Java, and Spring framework.
  • Monitoring metrics and tools (Confluent Control Center, Elasticsearch, etc.).
  • Managing log files and offsets.
  • Ensuring high availability and disaster recovery.
  • Achieving high availability through replication.
  • Tuning producer and consumer performance.
  • Disaster recovery strategies.
  • Failover control and data recovery techniques.
  • Connector configuration.
  • Implementing Kafka Connect.
  • Implementing Kafka security features.

Summary and Next Steps

Requirements

  • Familiarity with distributed systems and messaging concepts.
  • Proficiency with the Linux command line.
  • Basic understanding of networking and system administration.

Audience

  • System administrators.
  • DevOps engineers.
  • Platform and infrastructure teams.
 21 Hours

Testimonials (2)

Upcoming Courses

Related Categories