Apache Kafka: intermediate Kafka 2.1

https://www.skillsoft.com/channel/apache-kafka-2f44e500-0b21-11e7-a3e9-a39d8b265364?expertiselevel=43964&technologyandversion=43966 https://www.skillsoft.com/channel/apache-kafka-2f44e500-0b21-11e7-a3e9-a39d8b265364?expertiselevel=43964&technologyandversion=45441102 https://www.skillsoft.com/channel/apache-kafka-2f44e500-0b21-11e7-a3e9-a39d8b265364?expertiselevel=43965&technologyandversion=43966
  • 6 Courses | 5h 6m 55s
  • 3 Books | 12h 24m
  • Includes Lab
  • 5 Courses | 7h 7m 47s
  • 1 Book | 4h 23m
  • 1 Audiobook | 7h 3m 42s
  • 2 Courses | 1h 53m 18s
  • 3 Books | 12h 24m
  • Includes Lab
Rating 5.0 of 1 users Rating 5.0 of 1 users (1)
 
Apache Kafka is an open-source stream processing platform for providing a unified, high-throughput, low-latency and real-time data feeds.

GETTING STARTED

Apache Kafka: Concepts & Components

  • 3m 42s
  • 6m 6s

GETTING STARTED

Processing Data: Getting Started with Apache Kafka

  • 1m 20s
  • 10m 22s

GETTING STARTED

Apache Kafka: Clustering

  • 2m 48s
  • 7m 35s

COURSES INCLUDED

Apache Kafka: Concepts & Components
Apache Kafka is a distributed platform that lets you publish, store, and process streams of records. Here we look at the basic concepts of Kafka, and examine its main components.
8 videos | 33m has Assessment available Badge
Apache Kafka: Deployment & Operating Tasks
To get started with Apache Kafka, you must first deploy it. Let's examine the specification requirements, deployment to various platforms, and some of the main operating tasks you can perform in Kafka.
9 videos | 53m has Assessment available Badge
Apache Kafka: Operations
To maintain peak performance of your Kafka deployment, you must monitor and manage the servers. Learn how to use the tools available for monitoring Kafka, as well as how to perform tuning for Kafka.
9 videos | 36m has Assessment available Badge
Apache Kafka: Development
Apache Kafka comes with a set of APIs for consumers and producers to write to and read from logs. Explore both the producer and consumer APIs, data serialization/deserialization techniques, and strategies for testing Kafka.
18 videos | 1h 14m has Assessment available Badge
Apache Kafka: Integration with Spark
Apache Kafka can easily integrate with Apache Spark to allow processing of the data entered into Kafka. Discover how to integrate Kafka with Spark.
10 videos | 1h 2m has Assessment available Badge
Apache Kafka: Integration with Storm
Apache Kafka can easily integrate with Apache Storm to allow processing of the data entered into Kafka. Discover how to integrate Kafka with Storm.
8 videos | 46m has Assessment available Badge
SHOW MORE
FREE ACCESS

COURSES INCLUDED

Processing Data: Getting Started with Apache Kafka
Apache Kafka is a popular event streaming platform used by Fortune 100 companies for both real-time and batch data processing. In this course, you will explore the characteristics of event streaming and how Kafka architecture allows for scalable streaming data. Install Kafka and create some topics, which are essentially channels of communication between apps and data. Set up and work with multiple topics for durable storage. Create multiple brokers and cluster of nodes to handle messages and store their replicas. Then, monitor the settings and logs for those brokers. Finally, see how topic partitions and replicas provide redundancy and maintain high availability.
11 videos | 1h 31m has Assessment available Badge
Processing Data: Integrating Kafka with Python & Using Consumer Groups
Producers and consumers are applications that write events to and read events from Kafka. In this course, you will focus on integrating Python applications with a Kafka environment, implementing consumer groups, and tweaking Kafka configurations. Begin by connecting to Kafka from Python. You will produce to and consume messages from a Kafka topic using Python. Next, discover how to tweak Kafka broker configurations. You will place limits on the size of messages and disable deletion of topics. Then, publish messages to partitioned topics and explore the use of partitioning algorithms to determine the placement of messages on partitions. Explore consumer groups, which allow a set of consumers to process messages published to partitioned Kafka topics in parallel - without any duplication of effort. Finally, learn different ways to optimize Kafka's performance, using configurations for brokers and topics, as well as producer and consumer apps.
12 videos | 1h 24m has Assessment available Badge
Processing Data: Introducing Apache Spark
Apache Spark is a powerful distributed data processing engine that can handle petabytes of data by chunking that data and dividing across a cluster of resources. In this course, explore Spark's structured streaming engine, including components like PySpark shell. Begin by downloading and installing Apache Spark. Then create a Spark cluster and run a job from the PySpark shell. Monitor an application and job runs from the Spark web user interface. Then, set up a streaming environment, reading and manipulating the contents of files that are added to a folder in real-time. Finally, run apps on both Spark standalone and local modes.
13 videos | 1h 44m has Assessment available Badge
Processing Data: Integrating Kafka with Apache Spark
Flexible and Intuitive, DataFrames are a popular data structure in data analytics. In this course, build Spark applications that process data streamed to Kafka topics using DataFrames. Begin by setting up a simple Spark app that streams in messages from a Kafka topic, processes and transforms them, and publishes them to an output sink. Next, leverage the Spark DataFrame application programming interface by performing selections, projections, and aggregations on data streamed in from Kafka, while also exploring the use of SQL queries for those transformations. Finally, you will perform windowing operations - both tumbling windows, where the windows do not overlap, and sliding windows, where there is some overlapping of data.
12 videos | 1h 45m has Assessment available Badge
Processing Data: Using Kafka with Cassandra & Confluent
Apache Cassandra is a trusted open-source NoSQL distributed database that easily integrates with Apache Kafka as part of an ETL pipeline. This course focuses on that integration of Kafka, Spark and Cassandra and explores a managed version of Kafka with the Confluent data streaming platform. Begin by integrating Kafka with Apache Cassandra as part of an ETL pipeline involving a Spark application. Discover Apache Cassandra and learn the steps involved in linking Spark with this wide-column database. Next, examine the various features of the Confluent platform and find out how easy it is to set up and work with a Kafka environment. After completing this course, you will be prepared to implement and manage steam processing systems in your organization.
7 videos | 41m has Assessment available Badge
SHOW MORE
FREE ACCESS

COURSES INCLUDED

Apache Kafka: Clustering
What makes Apache Kafka so powerful and fault-tolerant is its clustering capabilities. Discover how to create and manage clusters in Kafka.
10 videos | 54m has Assessment available Badge
Apache Kafka: Real-time Applications
A major feature of Apache Kafka is building real-time applications that react to data streams. Discover how to create real-time applications in Kafka.
9 videos | 58m has Assessment available Badge

EARN A DIGITAL BADGE WHEN YOU COMPLETE THESE COURSES

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.

Digital badges are yours to keep, forever.

BOOKS INCLUDED

Book

Kafka in Action
Starting with an overview of Kafka's core concepts, you'll immediately learn how to set up and execute basic data movement tasks and how to produce and consume streams of events. Advancing quickly, you'll soon be ready to use Kafka in your day-to-day workflow, and start digging into even more advanced Kafka topics.
book Duration 4h 23m book Authors By Dave Klein, Dylan Scott, Viktor Gamov

Book

Big Data SMACK: A Guide to Apache Spark, Mesos, Akka, Cassandra, and Kafka
Explaining each of the full-stack technologies and, more importantly, how to best integrate them, this book provides detailed coverage of the practical benefits of these technologies and incorporates real-world examples in every situation.
book Duration 3h 56m book Authors By Isaac Ruiz, Raul Estrada

Book

Kafka Streams in Action: Real-time Apps and Microservices with the Kafka Streams API
This book teaches you everything you need to know to implement stream processing on data flowing into your Kafka platform, allowing you to focus on getting more from your data without sacrificing time or effort.
book Duration 4h 5m book Authors By William P. Bejeck Jr.

BOOKS INCLUDED

Book

Kafka in Action
Starting with an overview of Kafka's core concepts, you'll immediately learn how to set up and execute basic data movement tasks and how to produce and consume streams of events. Advancing quickly, you'll soon be ready to use Kafka in your day-to-day workflow, and start digging into even more advanced Kafka topics.
book Duration 4h 23m book Authors By Dave Klein, Dylan Scott, Viktor Gamov

BOOKS INCLUDED

Book

Kafka in Action
Starting with an overview of Kafka's core concepts, you'll immediately learn how to set up and execute basic data movement tasks and how to produce and consume streams of events. Advancing quickly, you'll soon be ready to use Kafka in your day-to-day workflow, and start digging into even more advanced Kafka topics.
book Duration 4h 23m book Authors By Dave Klein, Dylan Scott, Viktor Gamov

Book

Big Data SMACK: A Guide to Apache Spark, Mesos, Akka, Cassandra, and Kafka
Explaining each of the full-stack technologies and, more importantly, how to best integrate them, this book provides detailed coverage of the practical benefits of these technologies and incorporates real-world examples in every situation.
book Duration 3h 56m book Authors By Isaac Ruiz, Raul Estrada

Book

Kafka Streams in Action: Real-time Apps and Microservices with the Kafka Streams API
This book teaches you everything you need to know to implement stream processing on data flowing into your Kafka platform, allowing you to focus on getting more from your data without sacrificing time or effort.
book Duration 4h 5m book Authors By William P. Bejeck Jr.

AUDIOBOOKS INCLUDED

Audiobook

Kafka in Action
Master the wicked-fast Apache Kafka streaming platform through hands-on examples and real-world projects.
audiobook Duration 7h 3m 42s audiobook Authors By Dave Klein, Dylan Scott, Viktor Gamov

SKILL BENCHMARKS INCLUDED

Data Infrastructures with Apache Kafka Competency (Intermediate Level)
The Data Infrastructures with Apache Kafka Competency (Intermediate Level) benchmark measures your ability to recognize the basics, use cases, and core concepts of Apache Kafka. You will be evaluated on your skills in building Python applications that produce messages for and consume messages from Kafka topics, creating and using consumer groups, and developing Apache Spark applications that can manipulate streaming data. A learner who scores high on this benchmark demonstrates that they have the ability to work with Apache Kafka with minimal supervision.
18m    |   18 questions
Data Infrastructures with Apache Kafka Proficiency (Advanced Level)
The Data Infrastructures with Apache Kafka Proficiency (Advanced Level) benchmark measures your ability to build Apache Spark applications that process streaming data published to Kafka. You will be evaluated on your skills in applying transformations based on aggregations and window operations, setting up and managing a Kafka environment using Confluent, and defining an ETL pipeline involving Kafka, Spark, and Cassandra. A learner who scores high on this benchmark demonstrates that they have the skills to work on Kafka without any supervision.
16m    |   16 questions

YOU MIGHT ALSO LIKE

Channel Docker
Rating 5.0 of 1 users Rating 5.0 of 1 users (1)
Channel Apache Solr
Rating 5.0 of 1 users Rating 5.0 of 1 users (1)