Introduction to the Shell for Hadoop HDFS

Apache Hadoop 2.9    |    Beginner
  • 9 Videos | 56m 6s
  • Includes Assessment
  • Earns a Badge
Likes 22 Likes 22
In this Skillsoft Aspire course, learners discover how to set up a Hadoop Cluster on the cloud and explore bundled web apps—the YARN Cluster Manager app and the HDFS (Hadoop Distributed File System) NameNode UI. This 9-video course assumes a good understanding of what Hadoop is, and how HDFS enables processing of big data in parallel by distributing large data sets across a cluster; learners should also be familiar with running commands from the Linux shell, with some fluency in basic Linux file system commands. The course opens by exploring two web applications which are packaged with Hadoop, the UI for the YARN cluster manager, and the node name UI for HDFS. Learners then explore two shells which can be used to work with HDFS, the Hadoop FS shell and Hadoop DFS shell. Next, you will explore basic commands which can be used to navigate HDFS; discuss their similarities with Linux file system commands; and discuss distributed computing. In a closing exercise, practice identifying web applications used to explore and also monitor Hadoop.

WHAT YOU WILL LEARN

  • provision a Hadoop cluster on the cloud using the Google Cloud Platform's Dataproc service
    identify the various GCP services used by Dataproc when provisioning a cluster
    list the metrics available on the YARN Cluster Manager app and recognize how it can be useful to monitor job executions
    recall the details and metrics of HDFS available on the NameNode web app and how it can be used to browse the file system
  • identify the tools of the Hadoop ecosystem which are packaged with Hadoop and recall how they can be accessed
    configure HDFS using the hdfs-site.xml file and identify the properties which can be set from it
    compare the hadoop fs and hdfs dfs shells and recognize their similarities to Linux shells
    explore apps for Hadoop, configure HDFS, work with HDFS shells

IN THIS COURSE

  • Playable
    1. 
    Course Overview
    2m 19s
    UP NEXT
  • Playable
    2. 
    Creating a Hadoop Cluster on the Google Cloud
    9m 38s
  • Locked
    3. 
    Exploring Hadoop Clusters
    3m 56s
  • Locked
    4. 
    The YARN Cluster Manager UI
    9m 3s
  • Locked
    5. 
    The HDFS NameNode UI
    7m 3s
  • Locked
    6. 
    Browsing the Packaged Hadoop Tools
    4m 29s
  • Locked
    7. 
    Configuring HDFS
    4m 48s
  • Locked
    8. 
    The HDFS Shells
    5m 36s
  • Locked
    9. 
    Exercise: Introduction to the HDFS Shell
    5m 47s

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform

Digital badges are yours to keep, forever.

YOU MIGHT ALSO LIKE

Likes 11 Likes 11  
Likes 64 Likes 64  
Likes 19 Likes 19  

PEOPLE WHO VIEWED THIS ALSO VIEWED THESE

Likes 104 Likes 104  
Likes 64 Likes 64  
Likes 306 Likes 306