Microsoft Certified: Azure Data Engineer Associate: DP-203: Data Engineering on Microsoft Azure

  • 18 Courses | 24h 8m 54s
Likes 21 Likes 21
 
Explore how to build and maintain secure and compliant data processing pipelines by using different tools and techniques as you prepare for the DP-203: Data Engineering on Microsoft Azure certification exam.

GETTING STARTED

Data Engineering on Microsoft Azure: Storage Accounts

  • Playable
    1. 
    Course Overview
    1m 28s
    NOW PLAYING
  • Playable
    2. 
    Azure Blob Storage
    7m 4s
    UP NEXT

COURSES INCLUDED

Data Engineering on Microsoft Azure: Storage Accounts
Microsoft Azure Blob storage is a container system for storing a variety of file types. In this course, you'll learn about the capabilities of blob storage and how to architect a deployment for optimal performance and scalability. Then, you'll explore the options for redundancy and how to recover from disasters. You'll discover where Azure Data Lake Storage Gen2, a feature set within blob storage, can be utilized for big data operations. You'll also learn how to plan for a data lake deployment, examine best practices, and explore how to deploy a Data Lake Gen2 account on Azure. This course is one in a collection that prepares learners for the Microsoft Data Engineering on Microsoft Azure (DP-203) exam.
11 videos | 1h 16m
has Assessment available Badge
Data Engineering on Microsoft Azure: Designing Data Storage Structures
Planning the structure for data storage is integral to performance in big data operations. In this course, you'll learn about key considerations for data lakes and how to determine which file type and file format are the most appropriate for your use case. Then, you'll explore how to define how to design table storage for efficient querying and how data pruning can remove unnecessary data to accelerate transactions. You'll examine folder structures and data lake zones for organizing data effectively. Finally, you'll learn how to define storage tiers and how to manage the life cycle of data. This course is one in a collection that prepares learners for the Microsoft Data Engineering on Microsoft Azure (DP-203) exam.
11 videos | 1h 8m
has Assessment available Badge
Data Engineering on Microsoft Azure: Data Partitioning
Partitioning data is key to ensuring efficient processing. In this course, you'll explore what data partitioning is and the strategies for implementation. You'll learn about transactional and analytical workloads and how to determine the best strategy for your files and table storage. Then, you'll examine design patterns for efficiency and performance. You'll learn about partitioning dedicated SQL pools in Azure Synapse Analytics and partitioning data lakes. Finally, you'll learn how data sharding across multiple data stores can be used for improving transaction performance. This course is one in a collection that prepares learners for the Microsoft Data Engineering on Microsoft Azure (DP-203) exam.
11 videos | 1h 7m
has Assessment available Badge
Data Engineering on Microsoft Azure: Designing the Serving Layer
The serving layer is where data is stored for consumption by processing services. In this course, you'll explore dimensional data modeling and hierarchies. You'll learn how to define slowly changing dimensions and temporal design within databases. Then, you'll learn about the differences between the star and snowflake schemas as well as how to design a star schema. Next, you'll examine incremental data loading for stream processing and the options for analytical data stores. Finally, you'll learn about options for creating metastores for use by Azure Databricks and Azure Synapse Analytics. This course is one in a collection that prepares learners for the Microsoft Data Engineering on Microsoft Azure (DP-203) exam.
11 videos | 1h 16m
has Assessment available Badge
Data Engineering on Microsoft Azure: Physical Data Storage Structures
An effective storage structure is critical to big data implementation success. In this course, you'll explore data compression in databases and file storage. Then, you'll discover how partitioning and sharding are implemented in the database. Next, you'll explore designing tables in an Azure Synapse Analytics dedicated SQL pool, and implement geo-replication for redundancy in both databases and Azure Blob storage. You'll also discover implementing distribution schemes in Azure Synapse Analytics. Finally, you'll discover data archiving and long-term retention policies for Azure Blob storage and Azure SQL Databases. This course is one in a collection that prepares learners for the Microsoft Data Engineering on Microsoft Azure (DP-203) exam.
11 videos | 1h 9m
has Assessment available Badge
Data Engineering on Microsoft Azure: Logical Data Structures
Logical data structures, also called entity-relationship models, are models used to define a high-level model of data and the relationships contained within. In this course, you'll learn about the stages of data lake maturity. You'll explore temporal database tables and how to manage them. You'll also learn how to define slowly changing dimensions and how to implement them. You'll then move on to explore logical file and folder structures for data ingestion. You'll discover how PolyBase can be used to connect to external tables. Finally, you'll explore the best practices for accelerating queries. This course is one in a collection that prepares learners for the Data Engineering on Microsoft Azure (DP-203) exam.
11 videos | 1h 36m
has Assessment available Badge
Data Engineering on Microsoft Azure: The Serving Layer
Implementing an effective serving layer requires consideration for the design, methods, and tools. In this course, you'll learn how traditional relational models can be replaced by the star schema and how to design a star schema. Then, you'll explore the purpose and structure of Parquet files used by Azure Databricks. You'll learn how to design and query a dimensional hierarchy. You'll move on to examine Azure Synapse Analytics, including deploying dedicated SQL pools and Apache Spark clusters. Finally you'll learn how to create shared metadata tables between Spark clusters. This course is one in a collection that prepares learners for the Microsoft Data Engineering on Microsoft Azure (DP-203) exam.
11 videos | 1h 8m
has Assessment available Badge
Data Engineering on Microsoft Azure: Data Policies & Standards
Data policies and standards help to ensure a repeatable security standard is maintained. In this course, you'll learn about data encryption scenarios and best practices. You'll explore how Azure Transparent Database Encryption and Always Encrypted can be used to ensure data at rest is protected. Next, you'll examine how data classification and data masking can protect data being viewed. You'll learn to configure data retention and purging to ensure data is retained or removed. You'll also explore the various means of controlling access to Azure Data Lake Storage Gen2. Finally, you'll learn how to plan a data auditing strategy and how to limit access to data at the row level in a database. This course is one in a collection that prepares learners for the Microsoft Data Engineering on Microsoft Azure (DP-203) exam.
11 videos | 1h 7m
has Assessment available Badge
Data Engineering on Microsoft Azure: Securing Data Access
Securing access to data is a fundamental part of any security strategy. In this course, you'll explore how Azure Key Vault can be used to store and manage keys and secrets for accessing data. You'll discover how to connect to Azure resources through private and service endpoints and managed virtual networks and how to use Azure managed identities for connections between Azure resources. Next, you'll learn how to utilize access control lists and Azure role-based access control to provide only the necessary permissions to users to access your data. You'll also learn how token-based authentication works in Azure Databricks. Finally, you'll examine how to audit an Azure SQL Database to monitor for unauthorized access. This course is one in a collection that prepares learners for the Microsoft Data Engineering on Microsoft Azure (DP-203) exam.
12 videos | 1h 11m
has Assessment available Badge
Data Engineering on Microsoft Azure: Securing Data
The final line of defense for protecting against a data breach is securing the data itself. With today's cloud environments, data is often in transit, duplicated, and stored in various data centers around the world, making effective data protection a challenge. In this course, you'll explore the various methods available for encrypting data stored in SQL databases. You'll examine how to use DataFrames in Databricks, as well as how to implement Advanced Threat Protection and dynamic data masking in Azure databases. Finally, you'll learn how immutable blobs can be used to manage sensitive information. This course is one in a collection that prepares learners for the Microsoft Data Engineering on Microsoft Azure (DP-203) exam.
11 videos | 1h 25m
has Assessment available Badge