lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
PythonMIT
Stargazers
- James4Ever0
- ell-holParis, France
- AakashKumar144
- KaiQiangSongBellevue, WA
- aieveryday
- ZessayHangzhou
- Se-HunSeoul, South Korea
- DevRossGuangzhou, China
- xnliang98Beijing, China
- seshurajupHyderabad
- kgourgou
- conceptofmind
- TheRealAakash
- TianhongDaiUnited Kingdom
- shyamsn97
- makdoudNPalaiseau, France
- 6Maine
- jrfRochester, MN
- Fazziekeyshanghai
- smko77Korea
- likecoffee
- giovannizinzi
- nawnoesSeoul, Korea
- 0rchard
- tomByrerKansas City, MO
- kugwzk
- BladeSun
- DoohaeSeoul, Korea
- alec-tschantz
- johnnynunezBarcelona
- rodrigobaronBrazil
- djwei96Hong Kong SAR
- federico-m-lopez
- HeegyuKimSeoul, Korea
- elviswfShanghai, China
- LinxinS97LA