KDD_Benchmark

ECNU CS Data Mining, 2021 Spring

✔️ Our work ranked 2 finally. Not bad.

@Zhao Yunxiang, @Wei Mingda, @Wu Ronghuan

python3 trainer.py

you may need to modify the following line in trainer.py based on your python environment.

cmd = "/usr/local/bin/python3 evalution.py %s %s" % (gold_file, pred_file)

coauthor_1 + coauthor_2 + RandomForestClassifier: 75.08%
coauthor_1 + coauthor_2 + stringDistance_1 + stringDistance_2 + RandomForestClassifier: 93.5442%
Actually, the demo provided by my teacher performs really well :-)
2021/05/27: After adding some features, the accuracy reached 95.1%.
2021/05/31: After model ensemble, the accuracy reached 95.71%.

RandomForestClassifier performs well.

Finally, we merged different models to generate predictions.

According to the accuracy of various models, we assigned different models with different weights.

kingnobro/KDD-benchmark