Data Filtering

Data Science    |    Beginner
  • 11 videos | 56m 52s
  • Includes Assessment
  • Earns a Badge
Rating 4.3 of 84 users Rating 4.3 of 84 users (84)
Once data is gathered for data science, it is often in an unstructured or raw format and must be filtered for content and validity. Explore examples of practical tools and techniques for data filtering.

WHAT YOU WILL LEARN

  • Identify common filtering techniques and tools
    Extract date elements from common date formats
    Parse content types in http headers
    Use csvcut to filter csv data
    Use sed to replace values in a text data stream
    Drop duplicate records from data
  • Extract headers from a jpeg image
    Use pdfgrep to extract data from searchable pdf files
    Detect invalid or impossible data combinations
    Parse robots.txt from a web site to decide what should and shouldn't be crawled nor indexed
    Drop records from a csv file based on date range

IN THIS COURSE

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.

Digital badges are yours to keep, forever.

YOU MIGHT ALSO LIKE

Rating 4.6 of 30 users Rating 4.6 of 30 users (30)
Rating 4.8 of 12 users Rating 4.8 of 12 users (12)
Rating 4.5 of 13 users Rating 4.5 of 13 users (13)

PEOPLE WHO VIEWED THIS ALSO VIEWED THESE

Rating 4.4 of 59 users Rating 4.4 of 59 users (59)
Rating 4.2 of 2871 users Rating 4.2 of 2871 users (2871)
Rating 4.4 of 430 users Rating 4.4 of 430 users (430)