Hadoop MapReduce Applications With Combiners
Apache Hadoop 2.9
| Intermediate
- 13 Videos | 1h 23m 5s
- Includes Assessment
- Earns a Badge
In this Skillsoft Aspire course, explore the use of Combiners to make MapReduce applications more efficient by minimizing data transfers. Start by learning about the need for Combiners to optimize the execution of a MapReduce application by minimizing data transfers within a cluster. Recall the steps to process data in a MapReduce application, and look at using a Combiner to perform partial reduction of data output from the Mapper. Then create a new project to calculate average automobile prices using Maven for a MapReduce application. Next, develop the Mapper and Reducer to calculate the average price for automobile makes in the input data set. Create a driver program for the MapReduce application, run it, and check output to get the average price per automobile. Learn how to code up a Combiner for a MapReduce application, fix the bug in the application so it can be used to correctly calculate the average price, then run the fixed application to verify that the prices are being calculated correctly. The concluding exercise concerns optimizing MapReduce with Combiners.
WHAT YOU WILL LEARN
-
recognize the need for combiners to optimize the execution of a MapReduce application by minimizing data transfers within a clusterrecall the steps involved in processing data in a MapReduce applicationdescribe the working of a Combiner in performing a partial reduction of the data that is output from the Mapperconfigure a Combiner to optimize a MapReduce application that calculates an average valueuse Maven to create a new project for a MapReduce application and plan out the Map and Reduce phases by examining the auto prices datasetdevelop the Mapper and Reducer for the application that will calculate the average price for each make of automobile in the input dataset
-
create the driver program for the MapReduce applicationrun the MapReduce application and check the output to get the average price for each automobile makecode up a Combiner for the MapReduce application and configure the Driver to use it for a partial reduction on the Mapper nodes of the clusterfix the bug in the previous application by defining a type that represents both the aggregate price and count of automobiles that can be used to correctly calculate the average pricecompare the output of the modified application with the previous buggy version and verify that the average prices for the vehicles are being calculated correctlyidentify the shortcomings of regular MapReduce operations which are addressed by Combiners, and how Combiners differ from Reducers
IN THIS COURSE
-
1.Course Overview2m 32sUP NEXT
-
2.Combiners in MapReduce5m 19s
-
3.Revisiting MapReduce5m 2s
-
4.Working with Combiners5m 47s
-
5.Using Combiners for Calculating Averages8m 22s
-
6.Creating a Project to Calculate Averages7m 1s
-
7.Coding the Map and Reduce Phases8m 3s
-
8.Configure the Application in the Driver3m 13s
-
9.Executing the Application and Examining the Output7m 54s
-
10.Adding a Combiner to a MapReduce Application9m 31s
-
11.Conveying a Pair of Numbers from the Mapper9m 21s
-
12.Running the Fixed Application5m 6s
-
13.Exercise: Optimizing MapReduce With Combiners5m 54s
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform
Digital badges are yours to keep, forever.