Data Engineering on Microsoft Azure: Databrick Processing

Azure 2021    |    Intermediate
  • 11 Videos | 1h 53m 42s
  • Includes Assessment
  • Earns a Badge
When working with big data there needs to be a mechanism to process and transform this data quickly and efficiently. Azure Databricks is a service that provides the latest version of Apache Spark that provides functionality processing data from Azure Storage. In this course, you will learn about the types of processing that can be performed with Azure Databricks such as stream, batch, image and parallel processing. Next, you'll learn how to create an Azure Databricks workspace using an Apache Spark cluster, run jobs in the Azure Databricks Workspace jobs using a service principal and query data in SQL server using an Azure Databricks notebook. Next, you'll learn how to retrieve data from an Azure Blob Storage using Azure Databricks and the Azure Key Vault, implement a Cosmos DB service endpoint for Azure Databricks, and extract, transform, and load data using Azure Databricks. Finally, you'll learn how to stream data into Azure Databricks by using Event Hubs and perform sentiment analysis for steam data by making use of Azure Databricks. This course is one in a collection that prepares learners for the Microsoft Data Engineering on Microsoft Azure (DP-203) exam.

WHAT YOU WILL LEARN

  • discover the key concepts covered in this course
    describe the types of available processing when using Azure Databricks such as stream, batch, image and parallel processing
    create an Azure Databricks workspace using an Apache Spark cluster
    run jobs in the Azure Databricks Workspace jobs using a service principal
    query data in SQL server using an Azure Databricks notebook
    validate and handle failed batch loads
  • implement a Cosmos DB service endpoint for Azure Databricks
    extract, transform, and load data using Azure Databricks
    perform sentiment analysis for steam data by making use of Azure Databricks
    debug Spark Jobs running on HDInsight
    summarize the key concepts covered in this course

IN THIS COURSE

  • Playable
    1. 
    Course Overview
    2m 7s
    UP NEXT
  • Playable
    2. 
    Azure Databricks Processing
    8m 9s
  • Locked
    3. 
    Creating an Azure Databricks Workspace
    5m 58s
  • Locked
    4. 
    Running Azure Databricks Workspace Jobs
    21m 8s
  • Locked
    5. 
    Querying SQL Server
    12m 25s
  • Locked
    6. 
    Failed Batch Loads
    4m 40s
  • Locked
    7. 
    Implementing Cosmos DB Endpoints
    17m 59s
  • Locked
    8. 
    Extracting, Transforming, and Loading Data
    10m 31s
  • Locked
    9. 
    Performing Sentiment Analysis
    17m 10s
  • Locked
    10. 
    Debugging Spark Job
    8m 11s
  • Locked
    11. 
    Course Summary
    54s

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform

Digital badges are yours to keep, forever.