/RecurrentRLHF

Custom imitation library for making RecurrentReward model. I used GRU.

Primary LanguagePython

Stargazers