/Reuters-21578_LDA_Topic_Modelling

Reuters-21578 Corpus is a collection of documents consisting of news articles which appeared on Reuters newswire in 1987. The corpus is available in NLTK package in Python. Topic Modelling has been conducted on this Reuters-21578 corpus of news documents using Latent Dirichlet Allocation (LDA). The obtained topics have been visualized using proportional topics and words distributions, and also, topic word clouds.

Primary LanguagePython

Stargazers

No one’s star this repository yet.