lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
PythonMIT
Stargazers
- aashiqmuhamedCarnegie Mellon University
- AlekseyKorshuk@Coframe
- AlvL1225
- codycollierTexas
- DaehanKim
- Enescigdem
- feifeibearTencent
- felixdittrich92T2K
- Flowerfan
- fly51flyPRIS
- JacobFV@Limboid @ComputaCo
- karuniaperjuangan
- LeezekunSanta Barbara, CA
- Locchuong96@iteam1
- lxuechenStanford University
- MancheryTsinghua University
- monatis@qdrant
- naem1023Wrtn Technologies
- nateraw@huggingface
- ngthanhtinN.G.U
- noowad93Scatterlab, Pingpong
- olliestanleyUnited Kingdom
- poipiiiRSAF RAiD
- PWhiddySeattle WA
- quqixunChengdu,China
- ShawonAshrafellamind GmbH
- slyviacassell
- taidnguyenUniversity of Pennsylvania
- TearGosling
- techthiyanesBengaluru
- tmabraham
- ToheartZhangRenmin Univiersity of China
- TreezzZPennsylvania State University
- Yuan-ManXShanghai, China
- ZiruiOu
- ZIZUNJBNU-CCLab