nottombrown/rl-teacher
Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback
PythonMIT
Stargazers
- 0xhirokiBrooklyn, NY
- 8enmannSan Francisco
- ajsalkeldUnited Kingdom
- auxsophiaLas Vegas, Nevada
- awjulianiSan Francisco Bay Area
- bgoTürkiye
- Carmezim@colonynetworks
- charlieok
- curiousilyBulgaria
- edglazer
- edranWildest stealth climate tech company on the planet
- esafakArchipelago AI
- evancaseyBrookyn, NY
- eyadsibai
- Feryal@deepmind
- gdangeloAlterClass
- hiepph@TheRealResourcify
- hosford42@transparentai-tech
- jithinodattuinferencemachines
- jmarbach
- jppgksPredibase
- kylefritzAurora
- Laughing-Boy@fossasia @loklak @udacity
- lionelblondeSwitzerland
- loretoparisi@Musixmatchdev
- mekzaAmazon Web Services
- ngurnani
- nicieja
- nottombrownAnthropic
- odellus@phytomech
- sidbrahma@ibm-research
- stjordanisGreece
- thisrayNational Tsing Hua University
- vlad17
- xfcygaocanBeijing
- xixaiSan Francisco, CA