/30d-ML-2021

Primary LanguageJupyter Notebook

Kaggle 30d-ML-2021

Level 1: Base tree models of LGB, XGB, CatB. Tried versions with and without pseudo labels, where several different predictions from other Kagglers are used as the pseudo labels. Batch of similar models are also built with reduced features based on MI scores.

Level 2: Stacking on level 1 with metamodels of LGB and RF, input including both source data and base models results with pseudo labels. MLP is abandoned due to time constraint. Also built standalone strong LGB models. Only LGB no pseudo models stacking is tried, since no pseudo performance is less desirable and cannot share the Kfold with models utilising pseudo labels.

Level 3: Stacking on level 2 metamodels those with pseudo labels predictions by EN and Ridge. Finally, blending the two level 3 results with the level 2 no pseudo stacking model with the weights optimized by Nelder-Mead method.