Raw data files

Raw data files are stored in Raw Data.

The data file we used is Kickstarter.csv

Clean data files

The raw data files from Raw Data have been cleaned using clean_data.ipynb and the cleaned datasets are saved into the Clean_data folder.

We scraped the extra text features required using these files:

Scraped text data from the datasets in Clean_data will be uploaded in Output

The outputs are:

Once all relevant data has been scraped, we merged all of them back into one dataset.

This is done with combined_scraped_data.py, and the output dataset is saved as Combined_dataset.csv

We performed EDA on the dataset with EDA.ipynb

We perform feature preprocesing on the dataset with Features_Preprocessing.ipynb

Our models are:

Models are saved in output files as {model}.pckl

We perform stacking on all our models in Ensemblestacking.ipynb