Cyccyyycyc
Yan Cai is an undergraduate student at School of Cyber Science and Engineering, Wuhan University, China.
Wuhan UniversityWuhan
Pinned Repositories
Cyccyyycyc.github.io
safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Cyccyyycyc's Repositories
Cyccyyycyc/Cyccyyycyc.github.io
Cyccyyycyc/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback