/hyperparameters-optimization

The accompanying repo for the hyperparameters optimization bdx meetup talk, blog post and webinar

Primary LanguageJupyter NotebookMIT LicenseMIT

hyperparameters-optimization

About

This is the accompanying repo for the Bdx hyperparameters optimization meetup. It also contains the notebook and dataset for this Qucit blog post and also the hyperparameters optimization webinar.

Installation

To install the project dependencies, run:

pip install -r requirements

If you are familiar with Conda, I would suggest creating a virtual environnement and installing the dependencies in the following fashion:

conda create --name hyperparameters-optimization --file requirements.txt

Then activate the environment with the following command:

source activate hyperparameters-optimization

Datasets

IMDB

Raw dataset

In this talk, we analyze an IMDB dataset that you could find here. Notice that the dataset is also available in the data folder (titled "movies_metadata.csv").

Processed dataset

To get the processed dataset, run the following code (don't forget to activate your virtual env):

python scripts/imdb_data_processing.py

Airlines Delay

Raw dataset

In the blog post, we analyze the Airlines Delay dataset that you could find here. Notice that the dataset is also available in the data folder (titled "DelayedFlights.csv.zip").

Results

The live demo results are stored here.

Slides

The talk slides are available here.
The webinar slides are available here.

Notebooks

You can check the different notebooks used during the talk and the live demo by browsing the notebooks folder. It also contains the accompanying notebook for the blog post. The /notebooks/webinar folder contains the webinar notebook.

Resources

Xgboost

Dataset

  • You can find here a complete analysis from the dataset owner.

Hyperopt

License

The MIT License (MIT)