Fundamentals of NLP: Preprocessing Text Using NLTK & SpaCy

Natural Language Processing | Intermediate

13 videos | 1h 56m 47s
Includes Assessment
Earns a Badge

Tokenization, stemming, and lemmatization are essential natural language processing (NLP) tasks. Tokenization involves breaking text into units (tokens), such as words or phrases, facilitating analysis. Stemming reduces words to a common base form by removing prefixes or suffixes, promoting simplicity in representation. In contrast, lemmatization considers grammatical aspects to transform words into their base or dictionary form. You will begin this course by tokenizing text using the Natural Language Toolkit (NLTK) and SpaCy, which involves splitting a large block of text into smaller units called tokens, usually words or sentences. You will then remove stopwords, common words such as "a" and "the" that add little meaning to text. Next, you'll explore the WordNet lexical database, which contains information about the semantic relationship between words. You'll use Synsets to view similar words and explore hypernyms, hyponyms, meronyms and holonyms. Finally, you'll compare stemming and lemmatization using NLTK and SpaCy. You will explore both processes with NLTK and perform lemmatization using SpaCy.

WHAT YOU WILL LEARN

Discover the key concepts covered in this course

Perform tokenization with nltk

Perform tokenization with spacy

Remove stopwords using nltk

Remove stopwords using spacy

Explore wordnet synsets

Compute similarity of words
Explore types of words in wordnet

Perform stemming with nltk

Perform lemmatization with nltk

Perform lemmatization with spacy

Perform parts-of-speech (pos) tagging and named entity recognition (ner)

Summarize the key concepts covered in this course

IN THIS COURSE

2m 16s

In this video, we will discover the key concepts covered in this course. FREE ACCESS
12m 49s

Find out how to perform tokenization with NLTK. FREE ACCESS
3. Implementing Word and Sentence Tokenization Using SpaCy

12m 56s

In this video, you will learn how to perform tokenization with SpaCy. FREE ACCESS
4. Performing Stop Word Removal Using NLTK

10m 45s

During this video, you will discover how to remove stopwords using NLTK. FREE ACCESS
5. Performing Stopword Removal Using SpaCy

5m 37s

Learn how to remove stopwords using SpaCy. FREE ACCESS
6. Understanding WordNet Synsets

5m 55s

In this video, we will explore WordNet synsets. FREE ACCESS
7. Computing Word Similarity Using WordNet

10m 18s

Discover how to compute similarity of words. FREE ACCESS
8. Understanding Hypernyms, Hyponyms, Antonyms, Meronyms, and Holonyms

12m 26s

In this video, find out how to explore types of words in WordNet. FREE ACCESS
9. Performing Stemming Using NLTK

9m 51s

Learn how to perform stemming with NLTK. FREE ACCESS
10. Performing Lemmatization Using NLTK

8m 21s

In this video, you will discover how to perform lemmatization with NLTK. FREE ACCESS
11. Performing Lemmatization Using SpaCy

12m 18s

Find out how to perform lemmatization with SpaCy. FREE ACCESS
12. Performing Parts of Speech Tagging and Named Entity Recognition

9m 53s

Discover how to perform parts-of-speech (POS) tagging and named entity recognition (NER). FREE ACCESS
13. Course Summary

3m 23s

In this video, we will summarize the key concepts covered in this course. FREE ACCESS

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.

Digital badges are yours to keep, forever.

Course Text Mining and Analytics: Pattern Matching & Information Extraction

(17)

Course Prompt Engineering for Data: Basic Data Manipulation Using Generative AI

Course SSCP 2021: Understanding & Applying Cryptography

(20)

Get Started

Sharpen your skills. Upgrade your career. Find the right learning path for you, based on your role and skills. Take part in hands-on practice, study for a certification, and much more - all personalized for you.

*Not included: Compliance, Leadership Development Program content, and Engineering books

Your content + our content + our platform = a path to learning success

Using our learning experience platform, Percipio, your learners can engage in custom learning paths that can feature curated content from all sources.

Learn More

Aspire to something bigger

Aspire Journeys are guided learning paths that set you in motion for career success.

Browse Aspire Journeys

Explore a world of live learning with Global Knowledge

Choose from convenient delivery formats to get the training you and your team need - where, when and how you want it.

Browse Live Learning

IT Skills & Salary Report

ESG Impact Report

Fundamentals of NLP: Preprocessing Text Using NLTK & SpaCy

WHAT YOU WILL LEARN

IN THIS COURSE

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

YOU MIGHT ALSO LIKE