Can't archive result similar to the paper when fine tune with Reinforcement Learning
Opened this issue · 0 comments
ralphnlp commented
I run VHRL model more times, and I can't get the result similar to the paper. Generated reponses is so short and repetitive although I set hyperparameter as in the paper. Thanks you!!