huangjc
- Put
train_labels.csv
andtrain.csv
in folderinput/predict-student-performance-from-game-play
. About 5GB. - Run
python preprocess.py
- Run the training script
train_mlp.py
,train_catboost.py
andtrain_xgboost.py
- After training, you will see models saved in folder
models/
- When you have all the models in
models
, runevaluate.py
to make parameter iteration and compute F1 scores (on the full training set) submit.py
is a reference for submission to Kaggle. Note this is a local script, please upload a dataset and correct the model paths for a real online submission.
feature_engineer.py
is some magic.
See https://www.kaggle.com/code/vadimkamaev/catboost-new.
See also https://www.kaggle.com/code/gusthema/student-performance-w-tensorflow-decision-forests.