mandyyyyii

PhD Student from UCLA

Pinned Repositories

easy-to-hard
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Language:Python110 3 1211
lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python7.5k 39 1.2k2k
anonymous
Language:Python00
mandyyyyii.github.io
Language:HTML00
minicourse
coding and sharing examples for the SST group minicourses
00
scibench
Language:Python109 2 75
Smatify
smartify app
Language:Python0 2 00
OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Language:Python3.7k 30 391360

mandyyyyii's Repositories

mandyyyyii/scibench
Language:Python109 2 75
mandyyyyii/anonymous
Language:Python00
mandyyyyii/mandyyyyii.github.io
Language:HTML00
mandyyyyii/minicourse
coding and sharing examples for the SST group minicourses
00
mandyyyyii/Smatify
smartify app
Language:Python0 2 00