Course details

Getting Started with Hive: Bucketing & Window Functions

Getting Started with Hive: Bucketing & Window Functions


Overview/Description
Expected Duration
Lesson Objectives
Course Number
Expertise Level



Overview/Description

Apache Hive is one of the most popular data warehouses out in the market used for data science. In this Skillsoft Aspire course, you will explore how Hive query executions can be optimized, including techniques like bucketing datasets. Using window functions to extract meaningful insights from data is also covered.



Expected Duration (hours)
1.1

Lesson Objectives

Getting Started with Hive: Bucketing & Window Functions

  • Course Overview
  • implement bucketing for a Hive table and explore the structure of the table and bucket on HDFS
  • apply both bucketing and partitioning for a table and describe the structure of such a table on HDFS
  • extract further performance from Hive queries by sorting the contents of buckets
  • work with samples of a Hive table by dividing it into buckets
  • perform join operations on three or more tables by chaining the joins
  • implement a window function to calculate running totals on an ordered dataset
  • apply a window function within a partition of your dataset
  • apply bucketing of Hive tables to boost query performance and to use window functions
  • Course Number:
    it_dsgshvdj_06_enus

    Expertise Level
    Intermediate