Pinned Repositories
TaciturnMute's Repositories
TaciturnMute/RLHF4Math
This repository contains sources about reinforcement learning human feedback for math reasoning,.
TaciturnMute/FinRL
硕士毕业设计~
TaciturnMute/Psyduck
不要忘了我们的羁绊啊啊啊啊啊!(欢迎可达鸭爱好者来pull requests) :blush:
TaciturnMute/rl-zoo
DRL基础模型实现。适合初学者入门或熟练者复习回顾~