qwenzo/Udacity-Machine-Learning-Nanodegree-Capstone-Project

Jupyter Notebook

Starbucks Capstone Challenge

Udacity Machine learning Nanodegree

Table of Contents

Installation
Files
Acknowledgements

Installation

This project requires a python envirnoment (+v3.) along with the libraries seaborn, sklearn, pandas, maatlibplot and numpy.

Files

The data used in this project is a simplified data from the Starbucks Reward App.

The data is contained in three files:

portfolio.json - containing offer ids and meta data about each offer (10 rows x 6 columns)
profile.json - demographic data for each customer (17000 rows x 5 columns)
transcript.json - records for transactions, offers received, offers viewed, and offers completed (306534 rows × 4 columns)

data description:

portfolio.json

id (string) - offer id
offer_type (string) - type of offer ie BOGO, discount, informational
difficulty (int) - minimum required spend to complete an offer
reward (int) - reward given for completing an offer
duration (int) - time for offer to be open, in days
channels (list of strings)

profile.json

age (int) - age of the customer
became_member_on (int) - date when customer created an app account
gender (str) - gender of the customer (note some entries contain 'O' for other rather than M or F)
id (str) - customer id
income (float) - customer's income

transcript.json

event (str) - record description (ie transaction, offer received, offer viewed, etc.)
person (str) - customer id
time (int) - time in hours since start of test. The data begins at time t=0
value - (dict of strings) - either an offer id or transaction amount depending on the record

Acknowledgements

Credits to Starbucks for providng the dta for this Nanodegree.