Includes the following steps and more - Text preprocessing (lower case, tokenization, lemmatization, stemming), Text modelling (n-grams, bag of words, TF-IDF, Naive Bayes classification, RF classification, VADER Sentiment analysis, LDA topic modelling, word embeddings)
advaitsave/Text-preprocessing-and-classification
IMDB Movie Reviews - Text preprocessing and classification. Includes BOW model, TF_IDF, VADER entiment analysis, Topic Modelling using Latent Dirichlet Allocation and Word Embeddings. (Python)
Jupyter Notebook