Dataset consists of 3000 Amazon customer reviews, star ratings, date of review, variant and feedback of various amazon Alexa products like Alexa Echo, Echo dots.
- Discover insights into consumer reviews and perfrom sentiment analysis on the data.
- Split the data in Train and Test
- Train and Test the model in the data set
- Visualize data
- Get best accuracy
Open Google Colab https://colab.research.google.com/
- File
- Upload Notebook
- Run the Cells
Import the data set and visualize the data
- Data set
- Visualizing data points
- Histogram (bins=5)
Transforming the data
- Drop collumns
- Apply Dumies at Variation Collumn
- Add the encoded column again
- Concatenate them together
Improve the model
- Droping the variations and add review length as a feature
- Apply randomforest_classifier
Final Model represented by Confunsion Matrix