/lda_gibbs_sampling

a python implementation of latent dirichlet allocation(lda) using gibbs sampling algorithm

Primary LanguagePython

LDA (Latent Dirichlet Allocation)

This is a python implementation of LDA using gibbs sampling algorithm.

The following picture shows the top 10 words in the 10 topics (set K = 10) generated by this algorithm over 16 sentences about one piece on wikipedia.

res1

The following picture shows the top 10 words in the 10 topics (set K = 10) generated by this algorithm over 5000 chinese sina social news.

res2

The following picture shows the top 10 words in the 30 topics (set K = 30) generated by this algorithm over 5000 chinese sina social news.

res3

Author