SiyuanMaCS
Undergraduate student in Peking University. Interested in Artificial Intelligence, Reinforcement Learning, Large Language Models Fine Tuning etc.
Peking Univeristy
Pinned Repositories
digirl
Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
MLLM-protector
The official repository for paper "MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance"
foam-template
VisualRoleplay