lucidrains/PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

PythonMIT

Readme
46Issues
7.6kStargazers
142Watchers

Stargazers

Prev
Next

Contact site admin: Geeks.