Word Cloud is a great way to visually represent which words have more prominence most in a corpus. We are developing a new version of Word Cloud powered by Machine Learning to achive better results.
- Release a new version of WC that use Machine Learning for choosing meanful words in the corpus.
- Add a function for white/black listing words
- Initial version for testing WordCloud library only
- Create a baseline version of WC2 that use TF-ID for choosing the words
- Add ML pre-trained model based on Bert to extract meanful correlations
- Refine the model
- Keywords suggestion (provide the model which are the important topics to pay attention to)
- Add stemming (base vartiation of a word)
- Word black list (manually provide which words are not to be listed)
- API interface
- First official release
- Docker implementation
- Admin GUI interface