NLP Demystified 3: Basic Preprocessing (case-folding, stop words, stemming, lemmatization)

Future Mojo
Future Mojo
10.7 هزار بار بازدید - 2 سال پیش - Course playlist:
Course playlist: Natural Language Processing Demystified

Depending on our goal, we may preprocess text further. We'll cover case-folding, stop word removal, stemming, and lemmatization. We'll go over their use cases, their tradeoffs, and how to get them done using spaCy.

Colab notebook: https://colab.research.google.com/git...

Timestamps:
00:00:00 Basic Preprocessing
00:00:35 Case-folding and its tradeoffs
00:02:40 Stop word removal (tradeoffs and how it can go wrong)
00:04:40 Stemming (tradeoffs and things to watch out for)
00:06:28 Lemmatization and its advantages over stemming
00:07:52 DEMO: basic processing with spaCy
00:10:37 Basic preprocessing recap

This video is part of Natural Language Processing Demystified --a free, accessible course on NLP.

Visit https://www.nlpdemystified.org/ to learn more.
2 سال پیش در تاریخ 1401/02/07 منتشر شده است.
10,740 بـار بازدید شده
... بیشتر