Data Filtering
Data Science
| Beginner
- 11 videos | 56m 52s
- Includes Assessment
- Earns a Badge
Once data is gathered for data science, it is often in an unstructured or raw format and must be filtered for content and validity. Explore examples of practical tools and techniques for data filtering.
WHAT YOU WILL LEARN
-
identify common filtering techniques and toolsextract date elements from common date formatsparse content types in HTTP headersuse csvcut to filter CSV datause sed to replace values in a text data streamdrop duplicate records from data
-
extract headers from a jpeg imageuse pdfgrep to extract data from searchable pdf filesdetect invalid or impossible data combinationsparse robots.txt from a web site to decide what should and shouldn't be crawled nor indexeddrop records from a CSV file based on date range
IN THIS COURSE
-
1.Data Filtering Techniques and Tools3m 24sUP NEXT
-
2.Processing Date Formats6m 7s
-
3.Filtering HTTP Headers5m 11s
-
4.Filtering CSV Data4m 52s
-
5.Replacing Values with sed6m 16s
-
6.Dropping Duplicate Data4m 44s
-
7.Working with JPEG Headers6m 49s
-
8.Filtering PDF Files4m 55s
-
9.Filtering for Invalid Data5m 51s
-
10.Exercise: Cull Old Data3m 21s
-
11.Parsing robots.txt5m 22s
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.
Digital badges are yours to keep, forever.