Apache Spark SQL

Apache Spark 2.2    |    Intermediate
  • 16 Videos | 1h 7m 32s
  • Includes Assessment
  • Earns a Badge
Likes 56 Likes 56
Apache Spark SQL is used for structured data processing in Spark. Explore features of Spark SQL such as SparkSessions, DataFrames, and Datasets.

WHAT YOU WILL LEARN

  • describe Apache Spark SQL
    create a SparkSession
    create DataFrames with Spark SQL
    use aggregations with the built-in DataFrames functions
    run SQL queries programmatically
    create a global temporary view
    create Datasets with Spark SQL
    use JSON Datasets with Spark SQL
  • use Load/Save functions
    manually specify a data source
    run SQL directly on files
    use SaveMode to handle save operations
    write parquet files with Spark SQL
    use Spark SQL to save a DataFrame as a persistent table
    use partitioning when saving persistent tables
    use Spark SQL to create Datasets and DataFrames

IN THIS COURSE

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform

Digital badges are yours to keep, forever.

PEOPLE WHO VIEWED THIS ALSO VIEWED THESE

Likes 306 Likes 306  
Likes 193 Likes 193  
Likes 498 Likes 498