/rlhf

This is the repository for the Masters thesis project on Reinforcement Learning from Human Feedback.

Primary LanguagePython

No issues in this repository yet.