rll-research/BPref
Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.
PythonMIT
Stargazers
- 0ifeng0
- Andrewzh112Tsinghua University
- awesomerickySeoul, Republic of Korea
- caixq1996The University of Tokyo
- cedros23istanbul
- cindy17xn
- curieuxjyRobotics Innovatory at SKKU
- emigmoTsinghua University
- ernie55ernieTaipei, Taiwan
- evdcush
- HongJea-ParkMakinaRocks
- Hsiang-1East China Normal University
- JackZhangY
- jh-jeongKAIST
- junsu-kim97KAIST
- kli-casiaCASIA
- LeejwUniverse
- loofahcus01.ai
- MengHsuxCity University of Hong Kong
- n-shintaro
- nakamotooUC Berkeley
- norabelroseEleutherAI
- pjw1Makinarocks
- ShawnKSThe Chinese University of Hong Kong, Shenzhen
- ShelyH
- SSKK-L
- SSKKaiUniversity of Bristol
- sudo-michaelVancouver, BC
- tombewley@jpmorganchase
- tyleryzhuFremont, CA
- WaterkinSouth China University of Technology
- XizoBFudan University & Northwestern Polytechnical University
- yachenkangwestlake university
- younggyoseo
- yuanzhi0515
- zmtlandl