Data Engineering on Microsoft Azure: Designing Data Storage Structures

Azure 2021
  • 11 Videos | 1h 12m 38s
  • Includes Assessment
  • Earns a Badge
Planning the structure for data storage is integral to performance in big data operations. In this course, you'll learn about key considerations for data lakes and how to determine which file type and file format are the most appropriate for your use case. Then, you'll explore how to define how to design table storage for efficient querying and how data pruning can remove unnecessary data to accelerate transactions. You'll examine folder structures and data lake zones for organizing data effectively. Finally, you'll learn how to define storage tiers and how to manage the life cycle of data. This course is one in a collection that prepares learners for the Microsoft Data Engineering on Microsoft Azure (DP-203) exam.

WHAT YOU WILL LEARN

  • discover the key concepts covered in this course
    describe key considerations for designing a data lake
    identify and evaluate criteria for selecting a file format for big data applications
    recognize the defining characteristics of the supported file formats in Azure Data Lake
    describe steps for efficient read operations for a table storage service
    describe the dynamic data pruning feature in Databricks at the file and partition level
  • recognize an efficient folder structure design
    define the zones within a data lake for organizing data distribution
    describe the data access tiers in Azure Blob storage and how data can be moved between them for efficient and cost-effective storage
    describe the steps to archive data in an Azure Blob storage container, rehydrate blob data, and automate access tiers using life cycle management
    summarize the key concepts covered in this course

IN THIS COURSE

  • Playable
    1. 
    Course Overview
    1m 30s
    UP NEXT
  • Playable
    2. 
    Designing a Data Lake
    7m 12s
  • Locked
    3. 
    Big Data File Type Planning
    7m 17s
  • Locked
    4. 
    Big Data File Formats
    8m 46s
  • Locked
    5. 
    Designing Table Storage for Querying
    8m 31s
  • Locked
    6. 
    Dynamic Data Pruning
    6m 25s
  • Locked
    7. 
    Designing a Folder Structure
    7m 3s
  • Locked
    8. 
    Data Lake Zones
    5m 51s
  • Locked
    9. 
    Storage Archiving Tier
    6m 47s
  • Locked
    10. 
    Data Archiving, Rehydrating, and Life Cycle Management
    7m 52s
  • Locked
    11. 
    Course Summary
    54s

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform

Digital badges are yours to keep, forever.

YOU MIGHT ALSO LIKE