- Utilize Doc2Vec as a means to summarize raw text data as a method of dim. reduction
- Plot the data in a number of ways to visualize the document space
- Cluster the documents through various methods
- Clean the text as a means of improving clustering
- Develop labels for the clusters through topic modelling
- Implement new clustering algorithms