Can't archive result similar to the paper when fine tune with Reinforcement Learning

Question

Can't archive result similar to the paper when fine tune with Reinforcement Learning

Opened this issue 3 years ago · 0 comments

I run VHRL model more times, and I can't get the result similar to the paper. Generated reponses is so short and repetitive although I set hyperparameter as in the paper. Thanks you!!