Data Refinery with MapReduce
Apache Hadoop 2.0
| Intermediate
- 13 videos | 54m 55s
- Earns a Badge
MapReduce is a set of classes, which abstract away the complexity of parallel processing. Learn how MapReduce can take a single compute job and run it in our super computing platform.
WHAT YOU WILL LEARN
-
define the principle concepts of key-value pairs and list the rules for key-value pairsdescribe how MapReduce transforms key-value pairsload a large text book and then run WordCount to count the number of words in the text booklabel all of the functions for MapReduce on a diagrammatch the phases of MapReduce to their definitionsset up the classpath and test WordCountbuild a JAR file and run WordCount
-
describe the base mapper class of the MapReduce Java API and describe how to override its methodsdescribe the base Reducer class of the MapReduce Java API and describe how to override its methodsdescribe the function of the MapReduceDriver Java classset up the classpath and test a MapReduce jobidentify the concept of streaming for MapReducestream a Python job
IN THIS COURSE
-
4m 22s
-
2m 15s
-
2m 49s
-
9m 30s
-
5m 26s
-
2m 50s
-
5m 45s
-
2m 51s
-
2m 35s
-
3m 2s
-
5m 16s
-
3m 45s
-
4m 31s
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.
Digital badges are yours to keep, forever.