This the repository for Model Selection algorithm on Tabular Dataset.
Instruction to run the code:
- Run PrioritySearch.ipynb for priority search on synthetic dataset.
- Run Greedy_Search_Forward_Backward.ipynb for Greedy (Forward and Backward) on synthethic dataset.
- Run AdultDataset_GridSearch.ipynb for Grid search on Adult dataset
- Adult Dataset can be downloaded from here: http://archive.ics.uci.edu/ml/machine-learning-databases/adult/adult.data
- Adult Test Dataset can be downloaded from here : http://archive.ics.uci.edu/ml/machine-learning-databases/adult/adult.test
Notes:
- We recommend running these notebooks on colab
- Use same version as below to prevent version issues numpy = 1.24.3, pandas = 1.5.3, scikit-learn = 1.1.2, statsmodels = 0.14.0, shap = 0.42.1, matplotlib = 3.8.0