AWS Certified Machine Learning: Feature Engineering Techniques

Amazon Web Services 2021    |    Intermediate
  • 13 Videos | 28m 15s
  • Includes Assessment
  • Earns a Badge
Raw data is typically not perfect for developing effective machine learning (ML) models. Often, it needs to be processed using various feature engineering techniques to make it more suitable for building accurate and optimized ML models. Take this course to learn about techniques that help prepare the data to be compatible and improve the performance of machine learning models. Investigate techniques that are used to improve data usability, such as one-hot encoding, binning, transformations, scaling, and shuffling. You will also learn about the importance and usage of text feature engineering and major workflows in the AWS environment. After completing this course, you'll be able to implement feature engineering techniques using AWS workflows, further preparing you for the AWS Certified Machine Learning – Specialty certification exam.

WHAT YOU WILL LEARN

  • discover the key concepts covered in this course
    describe how to perform one-hot encoding and its main purpose
    define binning and discretization as the process of transforming numerical variables into categorical counterparts
    outline how data transformation can be used to make data more useful for data analysis
    define data scaling and normalization and describe why it is important to standardize independent variables
    outline data shuffling and define its role in removing biases and building more robust training models
    work with commonly used feature engineering techniques on real data
  • recognize the basic principles behind text feature engineering
    describe the process of term frequency-inverse document frequency (TF-IDF) and its uses in text mining
    describe bag-of-words model and compare it to TF-IDF
    describe the concept of n-gram and why they are used for machine learning
    use Spark and EMR workflows to prepare data for a TF-IDF problem
    summarize the key concepts covered in this course

IN THIS COURSE

  • Playable
    1. 
    Course Overview
    53s
    UP NEXT
  • Playable
    2. 
    Feature Engineering: One-hot Encoding
    1m 13s
  • Locked
    3. 
    Feature Engineering: Binning
    1m 2s
  • Locked
    4. 
    Feature Engineering: Data Transformations
    1m 29s
  • Locked
    5. 
    Feature Engineering: Data Scaling and Normalization
    1m 28s
  • Locked
    6. 
    Feature Engineering: Data Shuffling
    1m 12s
  • Locked
    7. 
    Working with Feature Engineering Techniques
    7m 6s
  • Locked
    8. 
    Text Feature Engineering
    1m 29s
  • Locked
    9. 
    Text Mining: TF-IDF
    1m 53s
  • Locked
    10. 
    Bag-of-Words Model vs. TF-IDF
    1m 27s
  • Locked
    11. 
    What are N-Grams?
    1m 55s
  • Locked
    12. 
    Using Spark and EMR Workflows for Data Preparation
    6m 21s
  • Locked
    13. 
    Course Summary
    46s

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform

Digital badges are yours to keep, forever.