Dataset Link : https://s3-ap-southeast-1.amazonaws.com/he-public-data/dataset52a7b21.zip
Classify 3 million products into around 10,000 categories.
- Sandeep Rajakrishnan
- Sudhay Senthilkumar
- Product Name
- Description
- Bullets
- Brand
- Product Node ID (Target)
- Preprocessed the dataset
- Removed stop words
- Extracted keywords
- Combined columns and prepared the final dataset
- Applied Count Vectorizer
- Calculated TF-IDF
- Fed the Data to ML Algorithms
August 1, 2020