OTUS_ADV_HW1: A Jupyter Notebook repository from oort77

OTUS Machine Learning Advanced

Goals:

AutoML - try out automatic feature generation/selection and modelling:

Compare AutoML performance in ATOM library (provides TPOT wrapper)
https://tvdboom.github.io/ATOM/about/
with baseline and two out of the box models.
Compare AutoML performance in AutoML mljar-supervised library
https://github.com/mljar/mljar-supervised
with two out of the box models. In addition, will try ensembling of autoML models.

Means:

AutoML tasks will be given to ATOM and mljar-supervised respectively. All preprocessing and pipelines management will be done in ATOM.

Dataset:

Choice of models:

Random Forest and CatBoost classifiers will compete with AutoML solution. LogisticRegression is added as a baseline in ATOM case.

Methodology:

OOB models' hyperparameters will be tuned with BO primarily to get some CV statistics and to level up the competition ground.
Weighted F1 score will be used as the main performance metrics following suggestion of the
competition organizers. Other metrics are collected where possible.

Colab notebooks:

ATOM autoML

mljar-supervised AutoML

oort77/OTUS_ADV_HW1