YifeiZhou02/ArCHer
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
Python
Stargazers
- andreazanette
- andyz245
- ariafyyAria.ai
- asmith26
- benglard
- closedLoopAstrocyte Research
- cray0000Decision Mapper (@dmapper)
- csholder
- dreasysnailApple
- forrestbingAlibaba Inc
- gaojingshengiCAT
- ggrizzly
- gonghaozhang
- gurusuraSura Systems Private Limited
- hackiey
- hi-abhi
- iwangjianPolyU
- jetnewNational University of Singapore
- Jiayi-PanUC Berkeley
- josecohenca
- kevinliang888Princeton
- Kunlun-ZhuMila-Quebec AI Institute; UdeM
- lamisgosu11VNUHCM-UIT
- makkzone
- Minami-su
- NatureGeorgePKU
- shixw1991
- SihanXUUniversity of Michigan, Ann Arbor
- snowkcon
- szrlee
- tokarev-i-v
- wx-bRIOS
- xnliang98Beijing, China
- yanbinwei
- Yannlecun
- YifeiZhou02