/rl-teacher

Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback

Primary LanguagePythonMIT LicenseMIT

Watchers

No one’s watching this repository yet.