Storage & MapReduce

Apache Hadoop 2.0
  • 11 Videos | 51m 33s
  • Includes Assessment
  • Earns a Badge
Likes 41 Likes 41
MapReduce is a framework for writing applications to process huge amounts of data. Let's look at Hadoop storage, MapReduce, and how to use MapReduce with associated development tools.

WHAT YOU WILL LEARN

  • Illustrate and describe the components of Hadoop. Know the different parts of Hadoop, and what purpose they serve.
    Understand and differentiate between the different types of data. Explain the difference between structured, unstructured, and semi-structured data
    Understand the cloud column family of databases, and how they are used by Hadoop to store data. Discuss four different types of cloud databases; column, key value, document and graph
    Understand the basics of the Hadoop Distributed file system. Know how the HDFS is architected and structured. Know how the HDFS compares with other popular file system models.
    Understand the DFS and learn basic HDFS navigation operations. Learn how to navigate the command line and look inside the HDFS.
    Learn how to perform file operations within the HDFS. Understand how to add and delete files, and how to list files and see their properties.
  • Understand the basic principles of MapReduce and general mapping issues. Be able to describe the two steps in Map Reduce, using Mappers and Reducers. Explain how  MapReduce is part of the Hadoop framework.
    Understand how Pig and Hive are used for Hadoop Map Reduce Jobs. Know the differences between Pig and Hive, and how each can be used for unstructured and structured data.
    Understand how Hadoop uses MapReduce. Be able to describe the MapReduce lifecycle. Know the role of a Job Client, Job Tracker, and Task Tracker. Know the workings of Map Tasks, and Reduce Tasks.
    Understand how Hadoop MapReduce handles and processes data. Explore the mapping and reducing steps in more detail. Know the vocabulary of the MapReduce dataflow process.
    Understand the process of Mapping and Reducing from a conceptual point of view. Also know how to programmatically start MapReduce processing with Mappers and Reducers.

IN THIS COURSE

  • Playable
    1. 
    Overview of Hadoop, Storage, MapReduce, Pig, and Hive
    3m 46s
    UP NEXT
  • Playable
    2. 
    Understanding Data
    7m 9s
  • Locked
    3. 
    Types of NoSQL Databases
    3m 43s
  • Locked
    4. 
    Introduction to the Hadoop Distributed File System
    3m 21s
  • Locked
    5. 
    Interacting with the HDFS
    3m 11s
  • Locked
    6. 
    File Operations within the HDFS
    3m 19s
  • Locked
    7. 
    The MapReduce Principles, Mappers, and Reducers
    7m 47s
  • Locked
    8. 
    Using MapReduce with Pig and Hive
    3m 24s
  • Locked
    9. 
    Introduction to the MapReduce Life Cycle
    4m 31s
  • Locked
    10. 
    Understanding the MapReduce Data Flow
    3m
  • Locked
    11. 
    Subdividing Data
    3m 22s

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform

Digital badges are yours to keep, forever.

YOU MIGHT ALSO LIKE

Likes 8 Likes 8  

PEOPLE WHO VIEWED THIS ALSO VIEWED THESE

Likes 66 Likes 66  
Likes 29 Likes 29  
Likes 35 Likes 35