This project is about exploring topic modelings and unsupervised machine learning by analyzing e-commerce clothings reviews.
In this project, I want to practice Natural Language Processing (NLP) and Unsupervised Machine Learning.
After some research on what dataset I could obtain from the web, I found a women clothings dataset of a real e-commerce business. I think it could be cool and useful to a business if I could develop an automation for the business to extract insights from their clothing reviews. Because it is not easy to read thousands of reviews and it is a time consuming task.
For complete write-up of this project, please go to my post on medium.
The problem that I am trying to solve is to apply Natural Language Processiong (NLP) on reviews for Topic Modeling and “separation” of good and bad reviews using clustering.
Secondary goal is to perform a supervised learning problem with the Rating column from the dataset to classify good and bad reviews.
notebooks
- My code for exploratory data analysis, topic modeling, clustering and classification models are in this folder.presentation
- This is the deck that I used to present in Metis. For references, please refer to the reference page in my deck.