Tags
Language
Tags
October 2025
Su Mo Tu We Th Fr Sa
28 29 30 1 2 3 4
5 6 7 8 9 10 11
12 13 14 15 16 17 18
19 20 21 22 23 24 25
26 27 28 29 30 31 1
    Attention❗ To save your time, in order to download anything on this site, you must be registered 👉 HERE. If you do not have a registration yet, it is better to do it right away. ✌

    ( • )( • ) ( ͡⚆ ͜ʖ ͡⚆ ) (‿ˠ‿)
    SpicyMags.xyz

    Data Engineering using Kafka and Spark Structured Streaming

    Posted By: Sigha
    Data Engineering using Kafka and Spark Structured Streaming

    Data Engineering using Kafka and Spark Structured Streaming
    MP4 | Video: h264, 1920x1080 | Audio: AAC, 44.1 KHz
    Language: English (US) | Size: 6.53 GB | Duration: 9h 35m

    A comprehensive Data Engineering course on building streaming pipelines using Kafka and Spark Structured Streaming

    What you'll learn
    Setting up self support lab with Hadoop (HDFS and YARN), Hive, Spark, and Kafka
    Overview of Kafka to build streaming pipelines
    Data Ingestion to Kafka topics using Kafka Connect using File Source
    Data Ingestion to HDFS using Kafka Connect using HDFS 3 Connector Plugin
    Overview of Spark Structured Streaming to process data as part of Streaming Pipelines
    Incremental Data Processing using Spark Structured Streaming using File Source and File Target
    Integration of Kafka and Spark Structured Streaming - Reading Data from Kafka Topics

    Requirements
    Laptop with decent configuration
    Decent internet speed to watch the lessons
    Self Support lab (instructions will be provided as part of the course) or ITVersity labs
    Knowledge about Functional Programming (preferably Python or Scala)
    Knowledge or experience using Spark

    Description
    As part of this course, you will be learning to build streaming pipelines by integrating Kafka and Spark Structured Streaming. Let us go through the details about what is covered in the course.First of all, we need to have the proper environment to build streaming pipelines using Kafka and Spark Structured Streaming on top of Hadoop or any other distributed file system. As part of the course, you will start with setting up a self-support lab with all the key components such as Hadoop, Hive, Spark, and Kafka on a single node Linux-based system.Once the environment is set up you will go through the details related to getting started with Kafka. As part of that process, you will create a Kafka topic, produce messages into the topic as well as consume messages from the topic.You will also learn how to use Kafka Connect to ingest data from web server logs into Kafka topic as well as ingest data from Kafka topic into HDFS as a sink.Once you understand Kafka from the perspective of Data Ingestion, you will get an overview of some of the key concepts of related Spark Structured Streaming.After learning Kafka and Spark Structured streaming separately, you will build a streaming pipeline to consume data from Kafka topic using Spark Structured Streaming, then process and write to different targets.You will also learn how to take care of incremental data processing using Spark Structured Streaming.Course OutlineHere is a brief outline of the course. You can choose either Cloud9 or GCP to provision a server to set up the environment.Setting up Environment using AWS Cloud9 or GCPSetup Single Node Hadoop ClusterSetup Hive and Spark on top of Single Node Hadoop ClusterSetup Single Node Kafka Cluster on top of Single Node Hadoop ClusterGetting Started with KafkaData Ingestion using Kafka Connect - Web server log files as a source to Kafka TopicData Ingestion using Kafka Connect - Kafka Topic to HDFS a sinkOverview of Spark Structured StreamingKafka and Spark Structured Streaming IntegrationIncremental Loads using Spark Structured StreamingUdemy based supportIn case you run into technical challenges while taking the course, feel free to raise your concerns using Udemy Messenger. We will make sure that issue is resolved in 48 hours.

    Who this course is for:
    Experienced ETL Developers who want to learn Kafka and Spark to build streaming pipelines, Experienced PL/SQL Developers who want to learn Kafka and Spark to build streaming pipelines, Beginner or Experienced Data Engineers who want to learn Kafka and Spark to build streaming pipelines


    Data Engineering using Kafka and Spark Structured Streaming


    For More Courses Visit & Bookmark Your Preferred Language Blog
    From Here: English - Français - Italiano - Deutsch - Español - Português - Polski - Türkçe - Русский