zhentingqi/rStar

could you recommend some classical self-play RL papers

Opened this issue · 1 comments

thank you

maybe this repository may be helpful! Awesome-LLM-Strawberry