/MasterResearchAutoML

This automl benchmark follows the standardized benchmark suite from openml

Primary LanguageJupyter Notebook

Automated Machine Learning : The Study of Top Algorithm Based On Benchmark Dataset

  • H2O AutoML
  • TPOT
  • Auto-sklearn
  • Hyperopt-Sklearn #Only Classifier, No Regression

Related papers

datasets

class

regression

Script for selected dataset

sel_clss_dtst = ['adult','agaricus-lepiota', 'churn', 'nursery', 'satimage','texture']
sel_rgrs_dtst = ['294_satellite_image','218_house_8L', '227_cpu_small', '503_wind', '344_mv','215_2dplanes']

Loop dataset

for regrs_dtst in sel_rgrs_dtst:
    df = fetch_data(regrs_dtst)
    print("Dataframe Name: ",regrs_dtst, ", Dataframe size: ", df.shape)
    
for class_dtst in sel_clss_dtst:
    df = fetch_data(class_dtst)
    print("Datasets Name: ",class_dtst, " , Number of missing values : ", df.isnull().sum().sum())

AUCpr

AUCPR = http://pages.cs.wisc.edu/~jdavis/davisgoadrichcamera2.pdf

© 2019 GitHub, Inc.