/Text-preprocessing-and-classification

IMDB Movie Reviews - Text preprocessing and classification. Includes BOW model, TF_IDF, VADER entiment analysis, Topic Modelling using Latent Dirichlet Allocation and Word Embeddings. (Python)

Primary LanguageJupyter Notebook

An introductory guide with steps to solve NLP problems

Includes the following steps and more - Text preprocessing (lower case, tokenization, lemmatization, stemming), Text modelling (n-grams, bag of words, TF-IDF, Naive Bayes classification, RF classification, VADER Sentiment analysis, LDA topic modelling, word embeddings)