Data Factory with Pig

Apache Hadoop 2.0
  • 12 Videos | 49m 45s
  • Earns a Badge
Likes 6 Likes 6
Pig is a data flow language for interfacing with Hadoop to extract, transform, and load data. Learn how to install & configure Pig, and use the command line to write and execute Pig scripts.

WHAT YOU WILL LEARN

  • describe Pig and its strengths
    recall the minimal edits needed to be made to the configuration file
    install and configure Pig
    recall the complex data types used by Pig
    recall some of the relational operators used by Pig
    use the grunt shell with PigLatin
  • set parameters from both a text file and with the command line
    write a Pig script
    use a Pig script to filter data
    use the ForEach operator with a Pig script
    set parameters and arguments in a Pig script
    write a Pig script to count data

IN THIS COURSE

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform

Digital badges are yours to keep, forever.

YOU MIGHT ALSO LIKE

Likes 22 Likes 22  
Likes 0 Likes 0  
COURSE Hadoop Ranger
Likes 4 Likes 4  

PEOPLE WHO VIEWED THIS ALSO VIEWED THESE

Likes 259 Likes 259  
Likes 104 Likes 104  
Likes 4 Likes 4