Course Content
-
Introduction
-
1. NLP Basics
- What are NLP and NLTK?
- NLTK setup and overview
- Reading in text data
- Exploring the dataset
- What are regular expressions?
- Learning how to use regular expressions
- Regular expression replacements
- Machine learning pipeline
- Implementation: Removing punctuation
- Implementation: Tokenization
- Implementation: Removing stop words
-
2. Supplemental Data Cleaning
-
3. Vectorizing Raw Data