OptimalScale/LMFlow

Why is Lisa’s training loss always 0?

orderer0001 opened this issue · 1 comments

Why is Lisa’s training loss always 0?

Thanks for your interest in LMFlow! That's strange, we are wondering if you could share more details of the training statistics and fine-tuning options, so we could help check the problem for you?