Data Factory with Pig
Apache Hadoop 2.0
| Intermediate
- 12 Videos | 47m 45s
- Earns a Badge
Pig is a data flow language for interfacing with Hadoop to extract, transform, and load data. Learn how to install & configure Pig, and use the command line to write and execute Pig scripts.
WHAT YOU WILL LEARN
-
describe Pig and its strengthsrecall the minimal edits needed to be made to the configuration fileinstall and configure Pigrecall the complex data types used by Pigrecall some of the relational operators used by Piguse the grunt shell with PigLatin
-
set parameters from both a text file and with the command linewrite a Pig scriptuse a Pig script to filter datause the ForEach operator with a Pig scriptset parameters and arguments in a Pig scriptwrite a Pig script to count data
IN THIS COURSE
-
1.Overviewing Pig7m 34sUP NEXT
-
2.Overview of Pig Configuration1m 17s
-
3.Installing Pig4m 9s
-
4.Pig Data Types2m 34s
-
5.Pig Operators1m 37s
-
6.Pig Command Line7m 2s
-
7.Pig Scripts2m 44s
-
8.First Pig Script3m 19s
-
9.Pig Filtering5m 48s
-
10.Pig ForEach4m 51s
-
11.Pig Parameters and Arguments3m 52s
-
12.Pig Functions3m
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform
Digital badges are yours to keep, forever.