Apache Spark SQL
Apache Spark 2.2
| Intermediate
- 16 Videos | 1h 2s
- Includes Assessment
- Earns a Badge
Apache Spark SQL is used for structured data processing in Spark. Explore features of Spark SQL such as SparkSessions, DataFrames, and Datasets.
WHAT YOU WILL LEARN
-
describe Apache Spark SQLcreate a SparkSessioncreate DataFrames with Spark SQLuse aggregations with the built-in DataFrames functionsrun SQL queries programmaticallycreate a global temporary viewcreate Datasets with Spark SQLuse JSON Datasets with Spark SQL
-
use Load/Save functionsmanually specify a data sourcerun SQL directly on filesuse SaveMode to handle save operationswrite parquet files with Spark SQLuse Spark SQL to save a DataFrame as a persistent tableuse partitioning when saving persistent tablesuse Spark SQL to create Datasets and DataFrames
IN THIS COURSE
-
1.Apache Spark SQL Overview2m 56sUP NEXT
-
2.SparkSession3m 37s
-
3.DataFrames4m 44s
-
4.Aggregations4m 9s
-
5.SQL Queries5m 10s
-
6.Temporary View3m 41s
-
7.Datasets2m 58s
-
8.JSON Datasets4m 50s
-
9.Load/Save Functions4m
-
10.Specifying a Data Source4m 23s
-
11.Querying with SQL3m 41s
-
12.SaveMode3m 26s
-
13.Parquet Files4m 5s
-
14.Persistent Tables2m 37s
-
15.Partitioning3m 4s
-
16.Exercise: Use Spark SQL2m 44s
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform
Digital badges are yours to keep, forever.