pickxiguapi/Uni-RLHF-Platform
Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
PythonMIT
Stargazers
- ayrnb
- clawnotfound
- cm090999
- cpd0101Baidu
- Crosser-XDUXidian University
- cybisolatedTianjin, China
- EarthringTsinghua University
- FhujinwuSouth China University of Technology
- hahaguoLixiang
- hilookas
- HuFeiHu
- huoliangyu
- infinfin
- Jasonxu1225The Chinese University of Hong Kong, Shenzhen
- JiangZhaoh
- JohannesAckTokyo
- liovn
- makdoudNPalaiseau, France
- nissymoriThe University of Tokyo
- pickxiguapi
- pskun@IDEA-CCNL
- sjYoondeltarSeoul
- superboySBBeijing Institute of Technology
- Taurids
- TianhongDai@ShadowFiendTeam
- xingruiyuUniversity of Technology Sydney
- zhanjiqing
- zhimin-zSoftware Analysis and Intelligence Lab
- Zhiyu-h
- ziyan-wang98KCL