This project aims to extract product defects and opinions from customer reviews by using text clustering and sentiment analysis. The customer review data used in the project are produced from Amazon Customer Reviews Dataset available here.
For further details, please look at our paper. Please cite this paper if you want to utilize this project.
→ Working environment
Python 3+ (developed with 3.7)
→ Required libraries
Available in requirements.txt
. Install with pip install -r requirements.txt
.
You can clone the project by running the following command or download it as a zip file via GitHub.
git clone https://github.com/SevcanDogramaci/AmazonReviewProject.git
To run the system, you can use either of the files and run :
→ product_category.ipynb
Jupyter Notebook in the ./test
folder
→ main.py
file in the root directory.
Two different datasets are provided for the test purposes in the ./data
folder.
You can also download datasets from here and create database files for these datasets using database.py
file in the root directory.