Data Refinery with MapReduce
Apache Hadoop 2.0
| Intermediate
- 13 Videos | 54m 55s
- Earns a Badge
MapReduce is a set of classes, which abstract away the complexity of parallel processing. Learn how MapReduce can take a single compute job and run it in our super computing platform.
WHAT YOU WILL LEARN
-
define the principle concepts of key-value pairs and list the rules for key-value pairsdescribe how MapReduce transforms key-value pairsload a large text book and then run WordCount to count the number of words in the text booklabel all of the functions for MapReduce on a diagrammatch the phases of MapReduce to their definitionsset up the classpath and test WordCountbuild a JAR file and run WordCount
-
describe the base mapper class of the MapReduce Java API and describe how to override its methodsdescribe the base Reducer class of the MapReduce Java API and describe how to override its methodsdescribe the function of the MapReduceDriver Java classset up the classpath and test a MapReduce jobidentify the concept of streaming for MapReducestream a Python job
IN THIS COURSE
-
1.Key-Value Pairs4m 22sUP NEXT
-
2.MapReduce and Key-Value Pairs2m 15s
-
3.WordCount, the Hello World of Hadoop2m 49s
-
4.MapReduce9m 30s
-
5.MapReduce Step-by-Step5m 26s
-
6.Exploring Hadoop Classpath2m 50s
-
7.Writing a MapReduce Job5m 45s
-
8.The Mapper Java API2m 51s
-
9.The Reducer Java API2m 35s
-
10.The Driver Java API3m 2s
-
11.Writing a MapReduce Job for Inventory5m 16s
-
12.Hadoop Streaming3m 45s
-
13.Running a Streaming Job4m 31s
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform
Digital badges are yours to keep, forever.