The data analysis baseline library.
- "Mr Sanchez, are you a data scientist?"
- "I dabl, Mr president."
Find more information on the website.
Right now, this library is still a prototype. API might change, and you shouldn't rely on it in any critical settings.
pip install dabl
This library is very much still under development. Current code focuses mostly on exploratory visualization and preprocessing. There are also drop-in replacements for GridSearchCV and RandomizedSearchCV using successive halfing. There are preliminary portfolios in the style of POSH auto-sklearn to find strong models quickly. In essence that boils down to a quick search over different gradient boosting models and other tree ensembles and potentially kernel methods.
Stay Tuned!