Pinned Repositories
easy-to-hard
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
lm-evaluation-harness
A framework for few-shot evaluation of language models.
anonymous
mandyyyyii.github.io
minicourse
coding and sharing examples for the SST group minicourses
scibench
Smatify
smartify app
OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
mandyyyyii's Repositories
mandyyyyii/scibench
mandyyyyii/anonymous
mandyyyyii/mandyyyyii.github.io
mandyyyyii/minicourse
coding and sharing examples for the SST group minicourses
mandyyyyii/Smatify
smartify app