rogerscristo/rl-teacher

Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback

PythonMIT

Readme
0Issues
0Stargazers
1Watcher

Watchers

jhcloos

Contact site admin: Geeks.