/online_retail

The goal of this project is to build an unsupervised machine learning model that predicts customers' next purchase date.

Primary LanguageJupyter NotebookGNU General Public License v2.0GPL-2.0

online_retail

Table of Contents

  1. Installation
  2. Project Motivation
  3. File Descriptions
  4. Results
  5. Licensing, Authors, and Acknowledgements

Installation

The Python libraries necessary to execute the codes in the Jupyter notebook are as follows:

In addition, one will need the Online Retail II UCI dataset available on Kaggle.

The dataset must be downloaded into the same directory that the Jupyter notebook is saved.

Project Motivation

In this project I use the given datasets mentioned in the Installation to give relevant answers to the following questions:

  1. How many online customers are there in the dataset and what is their country of origin?

  2. What are the countries that are most represented in the dataset?

  3. Calculate the revenue that was made in each month and what is the percentage revenue based on countries?

  4. Build a machine learning model to estimate if a given customer will buy something again from the online shop in the next quarter.

File Descriptions

There is only one Jupyter notebook and an HTML version of that assigned to this project whereby all the questions mentioned in the Project Motivation section are answered.

Results

The main results of my analysis in the Jupyter notebook have been communicated here.

Licensing, Authors, Acknowledgements

Big credit goes to Mashlyn for the Online Retail II UCI dataset which is licensed under the CC0: Public Domain. In addition, the repository is distributed under the GNU GPLv3 license.