/RLVF

Reinforcement Learning from Verifier Feedback in Coq

Primary LanguageJupyter Notebook

RLVF

Reinforcement Learning from Verifier Feedback in Coq