lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
PythonMIT
Watchers
- abodacsOpenCoast
- almekhlafifahad
- betoesquivel@facebook
- brataoEscavador
- bwcf99FagerstenTech
- cmcintoshWembassy
- damaruBangalore
- davidmroth
- dxzhang456
- ggilley
- gowithwind深圳
- igorcostaGitHub
- jiht76
- JohnnyOpcodeToronto, Ontario, Canada
- justicelee
- kahkeng
- kevinhuotari
- kokoima
- leozhaoCanada
- lucidrainsSan Francisco
- mad3310
- mbofb
- michalwolsNew York
- migueljetterev.com
- MilesLitteralManifold Valley
- nxtreaming
- pawpro
- pigloo
- relsiPorto Alegre - RS
- shantanusharmaSharma Labs
- sheshuguang
- ssergio198Earth
- strintSiliconFlow Inc
- trappedinspacetimeFor Personal Use
- vgoklaniNew York, NY
- wideblueskyTencent, Baidu, Advance AI, Ginee X