Optimizing Query Executions with Hive

Apache Hive 2.3.2 | Intermediate

7 videos | 42m 9s
Includes Assessment
Earns a Badge

(15)

In this 7-video Skillsoft Aspire course, learners can explore optimizations allowing Apache Hive to handle parallel processing of data, while users can still contribute to improving query performance. For this course, learners should have previous experience with Hive and familiarity with querying big data for analysis purposes. The course focuses only on concepts; no queries are run. Learners begin to understand how to optimize query executions in Hive, beginning with exploring different options available in Hive to query data in an optimal manner. Discuss how to split data into smaller chunks, specifically, partitioning and bucketing, so that queries need not scan full data sets each time. Hive truly democratizes access to data stored in a Hadoop cluster, eliminating the need to know MapReduce to process cluster data, and makes data accessible using the Hive query language. All files in Hadoop are exposed in the form of tables. Watch demonstrations of structuring queries to reduce numbers of map reduce operations generated by Hive, and speeding up query executions. Other concepts covered include partitioning, bucketing, and joins.

WHAT YOU WILL LEARN

Recognize how hive translates queries to hadoop mapreduce operations

Identify the different options available in hive to optimize query execution

Recall how partitioning of a dataset can help queries run efficiently and identify the types of partitioning available in hive
Specify how bucketing improves query performance and compare it with partitioning a dataset

Identify how to join tables in hive to ensure the best performance of your query

Work with techniques to improve performance and work with partitioning, bucketing and structured queries

IN THIS COURSE

2m 18s

FREE ACCESS
4m 52s

After completing this video, you will be able to recognize how Hive translates queries into Hadoop MapReduce operations. FREE ACCESS
3. Techniques to Improve Query Performance in Hive

6m 50s

In this video, find out how to identify the different optimization options available in Hive. FREE ACCESS
4. Partitioning Tables in Hive

8m 36s

Upon completion of this video, you will be able to recall how partitioning of a dataset can help queries run efficiently and identify the types of partitioning available in Hive. FREE ACCESS
5. Bucketing Tables in Hive

7m 17s

After completing this video, you will be able to specify how bucketing improves query performance and compare it with partitioning a dataset. FREE ACCESS
6. Structuring Join Queries in Hive

4m 36s

In this video, you will learn how to join tables in Hive to ensure the best performance of your query. FREE ACCESS
7. Exercise: Optimizing Query Execution in Hive

7m 40s

In this video, learn how to work with techniques to improve performance, work with partitioning, bucketing, and structured queries. FREE ACCESS

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.

Digital badges are yours to keep, forever.

Book Scaling Google Cloud Platform: Run Workloads Across Compute, Serverless PaaS, Database, Distributed Computing, and SRE

Book Database Performance at Scale: A Practical Guide

Book Big Data and Hadoop: Fundamentals, Tools, and Techniques for Data-Driven Success, 2nd Edition

PEOPLE WHO VIEWED THIS ALSO VIEWED THESE

Course Bucketing & Window Functions with Hive

(6)

Course Advanced Operations Using Hadoop MapReduce

(7)

Course Data Analysis Using the Spark DataFrame API

(31)

Get Started

Sharpen your skills. Upgrade your career. Find the right learning path for you, based on your role and skills. Take part in hands-on practice, study for a certification, and much more - all personalized for you.

*Not included: Compliance, Leadership Development Program content, and Engineering books

Your content + our content + our platform = a path to learning success

Using our learning experience platform, Percipio, your learners can engage in custom learning paths that can feature curated content from all sources.

Learn More

Aspire to something bigger

Aspire Journeys are guided learning paths that set you in motion for career success.

Browse Aspire Journeys

Explore a world of live learning with Global Knowledge

Choose from convenient delivery formats to get the training you and your team need - where, when and how you want it.

Browse Live Learning

IT Skills & Salary Report

ESG Impact Report

Optimizing Query Executions with Hive

WHAT YOU WILL LEARN

IN THIS COURSE

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

YOU MIGHT ALSO LIKE

PEOPLE WHO VIEWED THIS ALSO VIEWED THESE