The aim of this study is to explore different text mining techniques for extract relevant information and hidden knowledge from this corpus.
Steps :
o Text categorization.
o Creation of a Word cloud and other appropriate visualizations.
o Identifying the themes underlying the documents in a corpus.
o Dimensionality reduction.
o LIS: latent semantic indexing.
o LDA: latent Dirichlet allocation.