This project contains the implementation of the following:
- Regular Expressions
- Introduction to Numpy
- Introduction to Pandas
- Introduction to NLTK
- Introduction to Stemming
- Introduction to Lemmatization
- Removing Stop Words
- Extracting Parts Of Speech Tags
- Named Entity Recognition
- Text Modelling using Bag of Words Model (BOW)
- Text Modelling using TF-IDF Model
- Building Character Base N-Gram Model
- Building Word Base N-Gram Model
- Understanding Latent Semantic Analysis (LSA)
- Finding Synonyms and Antonyms using NLTK
- Word Negation Tracking
There are two presentation files in order to explain more about n-grams and LSA.