Getting Started with Hive

Apache Hive 2.3.2    |    Beginner
  • 10 Videos | 59m 47s
  • Includes Assessment
  • Earns a Badge
Likes 64 Likes 64
This 9-video Skillsoft Aspire course focuses solely on theory and involves no programming or query execution. Learners begin by examining what a data warehouse is, and how it differs from a relational database, important because Apache Hive is primarily a data warehouse, despite giving a SQL-like interface to query data. Hive facilitates work on very large data sets, stored as files in the Hadoop Distributed File System, and lets users perform operations in parallel on data in these files by effectively transforming Hive queries into MapReduce operations. Next, you will hear about types of data and operations which data warehouses and relational databases handle, before moving on to basic components of the Hadoop architecture.  Finally, the course discusses features of Hive making it popular among data analysts. The concluding exercise recalls differences between online transaction processing and online analytical processing systems, asking learners to identify Hadoop’s three major components; list Hadoop offerings on three major cloud platforms (AWS, Microsoft Azure, and Google Cloud Platform); and list benefits of Hive for data analysts.

WHAT YOU WILL LEARN

  • define what a data warehouse is and identify its characteristics
    describe the functions served by relational databases and the features they offer
    distinguish between Online Transaction Processing and Online Analytical Processing and identify the specific problems they are meant to solve
    identify where Hive fits in the Hadoop ecosystem and how it simplifies working with Hadoop
    describe the architecture of Hive and the functions served by HiveServer and the Metastore
  • identify the services and features offered by AWS, Azure, and GCP to run Hadoop and Hive on their infrastructure
    describe the different primitive and complex data types available in Hive
    compare managed and external tables in Hive and how they relate to the underlying data
    contrast OLTP and OLAP systems, identify major components of Hadoop, explore Hive benefits for data analysis

IN THIS COURSE

  • Playable
    1. 
    Course Overview
    2m 21s
    UP NEXT
  • Playable
    2. 
    Hive as a Data Warehouse
    4m 54s
  • Locked
    3. 
    Overview of Relational Databases
    4m 49s
  • Locked
    4. 
    OLTP and OLAP
    7m 3s
  • Locked
    5. 
    Hive and the Hadoop Ecosystem
    6m 51s
  • Locked
    6. 
    HiveServer and The Metastore
    7m 38s
  • Locked
    7. 
    Hive on Cloud Computing Platforms
    5m 40s
  • Locked
    8. 
    Data Types in Hive
    6m 19s
  • Locked
    9. 
    Data and Tables in Hive
    2m 46s
  • Locked
    10. 
    Exercise: Introduction to Hive
    7m 26s

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform

Digital badges are yours to keep, forever.

YOU MIGHT ALSO LIKE

PEOPLE WHO VIEWED THIS ALSO VIEWED THESE

Likes 121 Likes 121  
Likes 104 Likes 104  
Likes 56 Likes 56