Topic modelling and NLP with gensim and nltk
- Data cleaning
- Data pre-processing (lemmatize, stop words, tokenize)
- Finding optimum number of topics with coherence value
- Generating topic-word and document-topic distribution matrix
- Finding similar products with cosine similarity