Data Factory with Pig

Apache Hadoop 2.0    |    Intermediate
  • 12 videos | 47m 45s
  • Earns a Badge
Likes 7 Likes 7
Pig is a data flow language for interfacing with Hadoop to extract, transform, and load data. Learn how to install & configure Pig, and use the command line to write and execute Pig scripts.

WHAT YOU WILL LEARN

  • describe Pig and its strengths
    recall the minimal edits needed to be made to the configuration file
    install and configure Pig
    recall the complex data types used by Pig
    recall some of the relational operators used by Pig
    use the grunt shell with PigLatin
  • set parameters from both a text file and with the command line
    write a Pig script
    use a Pig script to filter data
    use the ForEach operator with a Pig script
    set parameters and arguments in a Pig script
    write a Pig script to count data

IN THIS COURSE

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.

Digital badges are yours to keep, forever.

YOU MIGHT ALSO LIKE

Channel Apache HBase
Likes 48 Likes 48  
Course Hadoop Ranger
Likes 8 Likes 8