A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)
Apache License 2.0Apache-2.0