/Awesome-RLHF

Awesome Reinforcement Learning from Human Feedback, the secret behind ChatGPT XD

MIT LicenseMIT

Awesome Reinforcement Learning from Human Feedback

GitHub stars GitHub forks GitHub activity

A collection of resources on Reinforcement Learning from Human Feedback (RLHF), mainly focused on pretrained models.

📜 Papers & Blog

Survey

Pre-LM RLHF

LM RLHF

Repos

Datasets

Videos & Lectures

TODO

  • Add more descriptions

📧Contact Me

If you have any question, please feel free to contact me (📧: andy.yangzhen@gmail.com).