Data Factory with Pig
Apache Hadoop 2.0
| Intermediate
- 12 videos | 47m 45s
- Earns a Badge
Pig is a data flow language for interfacing with Hadoop to extract, transform, and load data. Learn how to install & configure Pig, and use the command line to write and execute Pig scripts.
WHAT YOU WILL LEARN
-
describe Pig and its strengthsrecall the minimal edits needed to be made to the configuration fileinstall and configure Pigrecall the complex data types used by Pigrecall some of the relational operators used by Piguse the grunt shell with PigLatin
-
set parameters from both a text file and with the command linewrite a Pig scriptuse a Pig script to filter datause the ForEach operator with a Pig scriptset parameters and arguments in a Pig scriptwrite a Pig script to count data
IN THIS COURSE
-
7m 34s
-
1m 17s
-
4m 9s
-
2m 34s
-
1m 37s
-
7m 2s
-
2m 44s
-
3m 19s
-
5m 48s
-
4m 51s
-
3m 52s
-
3m
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.
Digital badges are yours to keep, forever.