ernie-research/Tool-Augmented-Reward-Model
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
PythonMIT
Stargazers
- 1787648106404
- 2391134843
- alphadlJD Explore Academy, JD.com Inc.
- amadeuzou
- chenxn2020
- ChristophAltBayer
- cordercorderTianjin University
- cyk1337Baidu
- emigmoTsinghua University
- Fiiicus
- flow3rdownZhejiang Univeristy, Tencent
- forrestbingAlibaba Inc
- GanjinZeroDAMO Academy
- GasolSun36DAMO Academy, Alibaba Group
- hongtangshuisjtu & bytedance
- jc-ryanUniversity of Chinese Academy of Sciences
- jinuk0211대구 북구
- josecohenca
- kanseaveg
- kevon217Publicis Sapient
- KpKqwq
- lipijiNUAA
- neneluoEdinburgh
- randxieExplore New Things
- ruleGreenThe Chinese University of Hong Kong
- scottsuk0306@kaistAI
- shyamsn97
- SUDA-HLT-ywfangSoochow University
- TableLLM
- wccccp
- william-ljz
- wonderseen
- wyclike
- zhaochen0110Soochow University
- zhouweixiaoBUAA
- zxlzrearth