tnilanon/rl-teacher
Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback
PythonMIT
Watchers
No one’s watching this repository yet.
Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback
PythonMIT
No one’s watching this repository yet.