opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
Apache-2.0
Stargazers
- aieveryday
- alexunderch
- Chenfeng1271P.h.D student@University of Adelaide
- flrngel@Ainbr
- fuqianya
- Gangsss@bhsn-ai
- hany606Daejeon, South Korea
- hsh6449GIST AI Graduate school
- LegendBCHuazhong Uni. of Sci. and Tec.
- lixl-st
- MancheryTsinghua University
- MengWoods
- momozzingKonan Tech
- nikitavoloboevMadrid
- p-raj
- PaParaZz1@opendilab
- Poet-LiBai
- qgzangUSTC
- rentainheIDEA
- RIP4KOBEThe Chinese University of Hong Kong
- rodrigodelazcano@Farama-Foundation
- ruoyuGaoAWS
- ShaoZhang0115Shanghai Jiao Tong University
- simonjisuSeoul National University
- sjYoondeltarSeoul
- slyviacassell
- Thirteentj
- tuofeilunhifiLi Auto
- VaninaY
- vasgaoweiAlibaba Group
- Walter0807CFCS, Peking University
- wondervictorHuazhong University of Science and Technology
- Xiang-PanNational University of Singapore
- yukyungleeKorea University DSBA Lab.
- ZeavanLiBeijing, China
- zichuan-liuMSc at Nanjing University | Formerly Interned at Microsoft Reaseach Asia and Alibaba DAMO Academy