samuelbalt/learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
PythonMIT
Watchers
No one’s watching this repository yet.
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
PythonMIT
No one’s watching this repository yet.