Developing a Basic MapReduce Hadoop Application
Apache Hadoop 2.9
| Beginner
- 10 Videos | 1h 13m 19s
- Includes Assessment
- Earns a Badge
In this Skillsoft Aspire course, discover how to use Hadoop's MapReduce; provision a Hadoop cluster on the cloud; and build an application with MapReduce to calculate word frequencies in a text document. To start, create a Hadoop cluster on the Google Cloud Platform using its Cloud Dataproc service; then work with the YARN Cluster Manager and HDFS (Hadoop Distributed File System) NameNode web applications that come packaged with Hadoop. Use Maven to create a new Java project for the MapReduce application, and develop a mapper for word frequency application. Create a Reducer for the application that will collect Mapper output and calculate word frequencies in input text files, and identify configurations of MapReduce applications in the Driver program and the project's pom.xml file. Next, build the MapReduce word frequency application with Maven to produce a jar file and prepare for execution from the master node of the Hadoop cluster. Finally, run the application and examine outputs generated to get word frequencies in the input text document. The exercise involves developing a basic MapReduce application.
WHAT YOU WILL LEARN
-
create and configure a Hadoop cluster on the Google Cloud Platform using its Cloud Dataproc servicework with the YARN Cluster Manager and HDFS NameNode web applications that come packaged with Hadoopuse Maven to create a new Java project for the MapReduce applicationdevelop a Mapper for the word frequency application that includes the logic to parse one line of the input file and produce a collection of keys and values as outputcreate a Reducer for the application that will collect the Mapper output and calculate the word frequencies in the input text file
-
specify the configurations of the MapReduce applications in the Driver program and the project's pom.xml filebuild the MapReduce word frequency application using Maven to produce a jar file and then prepare for execution from the master node of the Hadoop clusterrun the application and examine the outputs generated to get the word frequencies in the input text documentidenfity the apps packaged with Hadoop and the purposes they serve and recall the classes/methods used in the Map and Reduce phases of a MapReduce application
IN THIS COURSE
-
1.Course Overview2m 50sUP NEXT
-
2.Provisioning a Hadoop Cluster on the Cloud9m 13s
-
3.Browsing the Hadoop Web Applications7m 46s
-
4.Creating a MapReduce project6m 43s
-
5.Coding the Map Phase7m 19s
-
6.Coding the Reduce Phase6m 24s
-
7.Defining the Driver Program8m 33s
-
8.Building the Application8m 23s
-
9.Executing the MapReduce Application8m 56s
-
10.Exercise: Developing a Basic MapReduce Application7m 12s
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform
Digital badges are yours to keep, forever.