Inconsistent Evaluation Results

Question

Inconsistent Evaluation Results

yongzx opened this issue 3 years ago · 0 comments

I am getting different results by running training/eval together and separately.
Rerunning evaluation after training (by removing --do_train) gives me a better result than running training+eval together.