lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
PythonMIT
Watchers
- akirti
- matarof
- rishikksh20New Delhi, India
- ion-stormroot@localhost
- kennyk3Charlotte, NC
- ArunkumarRamananBangalore, India
- didiniao
- zarandioon
- mobarmgSaudi Arabia
- duyvuleoAustralia
- xiaoke912
- dariorl
- e-motionlabs
- oldlee11beijing
- vchiley
- UrielChParis
- diiviousSaxapahaw, NC, USA
- Athomield
- virtualrobotixItaly
- shicz86
- prakashs12
- pxzero
- hdchao
- rorosan
- jrrexliang01Tianjin
- gabrielclimaRio de Janeiro
- androm3daAustin, TX USA
- Swall0wTokyo
- beelzebub2006shanghai
- sanshanxiashi
- nsl2014fm
- gouxuteChina
- BenDerPan
- wuchtw
- radovankavickyBratislava, Slovakia
- ruapotato