Each business's mission is to satisfy their customers. With an e-commerce platform where customers' purchase is affected by the product's feedback and sellers' ratings, keeping a positive feedback loop is crucial for customer return and sales growth. Therefore, in this project, we're interested in predicting the customer review scores, to better understand the customers' experience through product quality, sellers' reputation, and shipping process. This would help the company to continue selecting better products to offer on their platform, recommend more reliable sellers, improve the delivery operation, and especially, design the effective customer services and marketing campaigns to different targeted customers.
In this project, we use data shared by Olist - a Brazilian e-commerce platfrom. The data set contains 9 sub-datasets that offer valuable information on 3 main aspects of an e-commerce transaction:
Products
: category, description, size & weight of the product.Customers and sellers
: tracking id & shipping location.Order and shipping process
: number of items, tracking timestamp of the purchase & shipping, customer reviews (by score and text).
In addition, Brazilian demographic data at state level is used for customer's social-economic context.
You can find the complete notebooks in this repository or click the links below for a quick view:
- Olist_Data_Prep.ipynb : The pre-processing steps to clean the data.
- Olist_EDA_Feature_Engineering.ipynb : The exploratory analysis and feature engineering.
- Olist_Modeling.ipynb : The modeling steps and result interpretation.
- A Tableau Dashboard on Olist Customer Ratings.
This is an extended version of a group project at the University Paris Dauphine. I'd like to thank my groupmate Zoé Hance whom I worked with intensively throughout the project.
Raw data of Olist and detailed descriptions of the attributes were retrieved from this post on Kaggle: Brazilian E-Commerce Public Dataset by Olist .
Raw data of Brazil social-economic factors were loaded from Brazilian Institute of Geography and Statistics:
household average income and population by state.