tomekkorbak/pretraining-with-human-feedback
Code accompanying the paper Pretraining Language Models with Human Preferences
PythonMIT
Stargazers
- 51616https://vistec.ist/
- aflah02Max Planck Institute for Software Systems: MPI SWS
- alec-tschantz
- caigaojiang
- CBlagdenCalifornia Institute of Technology
- crazyofappleShenzhen
- Dahoas
- fly51flyPRIS
- GeeYangML
- haileyschoelkopf@EleutherAI
- hlzhang109Cambridge, MA
- hundredeuk2Hanyang Univ. BIS Lab
- jon-towNew York, New York
- kjappelbaumEPFL
- kwon13Seoul
- Life-0-1
- meet-cjli
- mindgitrwx
- monopoly-db
- nateraw@huggingface
- NyandwiCargenie Mellon
- omarsarDAIR.AI
- qtvhaoFPT Software
- QwinpinHuawei
- rich-junwang
- rodrigobaronBrazil
- roszcz@Nospoko
- Se-HunHanwHa Life
- sudahui
- SushantDaga
- thejaminatorSERIMATS
- TheodoreGalanosAustrian Institute of Technology
- tmabraham
- tmgthbKyndryl
- UeFan
- xanderdunn