Previous Page

Getting Started with Hive: Viewing and Querying Complex Data

Getting Started with Hive: Viewing and Querying Complex Data


Overview/Description
Expected Duration
Lesson Objectives
Course Number
Expertise Level



Overview/Description

Apache Hive is one of the most popular data warehouses out in the market used for data science. Hive simplifies working with large datasets in files by representing them as tables and allowing them to be queried with a simple and intuitive query language. In this Skillsoft Aspire course, you will explore working with complex data types in Hive.



Expected Duration (hours)
1.2

Lesson Objectives

Getting Started with Hive: Viewing and Querying Complex Data

  • Course Overview
  • load and access data in the form of arrays
  • work with data in the form of key-value pairs - map data structures in Hive
  • define and use structured data in the form of Hive struct types
  • transform complex data types to a tabular format to facilitate analysis using the explode and posexplode functions
  • combine the results of the explode function with other columns of a table to generate a lateral view
  • flatten multi-dimensional data structures by chaining lateral views
  • use the UNION and UNION ALL operations on table data and distinguish between the two
  • search for values in the results of a subquery using the IN and EXIST clauses
  • create and load data into tables efficiently by including these operations in a single query
  • define and work with views in Hive to simplify querying and control access to data
  • perform queries and utilize views on complex data types available in Hive
  • Course Number:
    it_dsgshvdj_03_enus

    Expertise Level
    Beginner