Data Analysis using Spark SQL

Apache Spark 2.3    |    Intermediate
  • 9 Videos | 57m 36s
  • Includes Assessment
  • Earns a Badge
Likes 66 Likes 66
Analyze an Apache Spark DataFrame as though it were a relational database table. During this Aspire course, you will discover the different stages involved in optimizing any query or method call on the contents of a Spark DataFrame. Discover how to create views out of a Spark DataFrame's contents and run queries against them; and how to trim and clean a DataFrame. Next, learn how to perform an analysis of data by running different SQL queries; how to configure a DataFrame with an explicitly defined schema; and define what a window is in the context of Spark. Finally, observe how to create and analyze categories of data in a data set by using Windows.

WHAT YOU WILL LEARN

  • recall the different stages involved in optimizing any query or method call on the contents of a Spark DataFrame
    create views out of a Spark DataFrame's contents and run queries against them
    trim and clean a DataFrame before a view is created as a precursor to running SQL queries on it
    perform an analysis of data by running different kinds of SQL queries, including grouping and aggregations
  • recognize how Spark DataFrames infer the schema of data loaded into them and configure a DataFrame with an explicitly defined schema
    define what a window is in the context of Spark DataFrames and when they can be used
    create and analyze categories of data in a dataset using Windows
    analyze data using Spark SQL

IN THIS COURSE

  • Playable
    1. 
    Course Overview
    2m 45s
    UP NEXT
  • Playable
    2. 
    The Spark Catalyst Optimizer
    3m
  • Locked
    3. 
    Introduction to Spark SQL
    7m 41s
  • Locked
    4. 
    Preparing Data for Analysis
    7m 14s
  • Locked
    5. 
    Running SQL Queries
    6m 44s
  • Locked
    6. 
    Inferred and Explicit Schemas
    7m 56s
  • Locked
    7. 
    Windowing in Spark
    5m 53s
  • Locked
    8. 
    Applying Window Functions
    8m 3s
  • Locked
    9. 
    Exercise: Data Analysis Using Spark SQL
    4m 50s

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform

Digital badges are yours to keep, forever.

PEOPLE WHO VIEWED THIS ALSO VIEWED THESE

Likes 64 Likes 64  
Likes 121 Likes 121  
Likes 104 Likes 104