Data Filtering

Data Science    |    Beginner
  • 11 videos | 56m 52s
  • Includes Assessment
  • Earns a Badge
Rating 4.3 of 84 users Rating 4.3 of 84 users (84)
Once data is gathered for data science, it is often in an unstructured or raw format and must be filtered for content and validity. Explore examples of practical tools and techniques for data filtering.

WHAT YOU WILL LEARN

  • Identify common filtering techniques and tools
    Extract date elements from common date formats
    Parse content types in http headers
    Use csvcut to filter csv data
    Use sed to replace values in a text data stream
    Drop duplicate records from data
  • Extract headers from a jpeg image
    Use pdfgrep to extract data from searchable pdf files
    Detect invalid or impossible data combinations
    Parse robots.txt from a web site to decide what should and shouldn't be crawled nor indexed
    Drop records from a csv file based on date range

IN THIS COURSE

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.

Digital badges are yours to keep, forever.

PEOPLE WHO VIEWED THIS ALSO VIEWED THESE

Rating 4.2 of 363 users Rating 4.2 of 363 users (363)
Rating 4.3 of 357 users Rating 4.3 of 357 users (357)
Rating 4.1 of 72 users Rating 4.1 of 72 users (72)