Getting Started with Hive: Introduction
Getting Started with Hive: Introduction
Overview/Description
Expected Duration
Lesson Objectives
Course Number
Expertise Level
Overview/Description
Apache Hive is one of the most popular data warehouses out in the market used for data science. It allows the processing of big data in parallel in a cluster using a simple and intuitive query language. In this Skillsoft Aspire course, you will discover the fundamental concepts of Hive.
Expected Duration (hours)
0.9
Lesson Objectives Getting Started with Hive: Introduction
Course Overview
define what a data warehouse is and identify its characteristics
describe the functions served by relational databases and the features they offer
distinguish between Online Transaction Processing and Online Analytical Processing and identify the specific problems they are meant to solve
identify where Hive fits in the Hadoop ecosystem and how it simplifies working with Hadoop
describe the architecture of Hive and the functions served by HiveServer and the Metastore
identify the services and features offered by AWS, Azure, and GCP to run Hadoop and Hive on their infrastructure
describe the different primitive and complex data types available in Hive
compare managed and external tables in Hive and how they relate to the underlying data
contrast OLTP and OLAP systems, identify major components of Hadoop, explore Hive benefits for data analysis
Course Number: it_dsgshvdj_01_enus
Expertise Level
Beginner