/halu_clf

Primary LanguagePython

halu_clf

Dataset Acc F1
TruthfulQA (Train) 0.999662 0.999616
TruthfulQA (Test) 0.914160 0.902383
MMLU (Train) 0.997061 0.994098
MMLU (Test) 0.672436 0.212097