Pinned Repositories
alphageometry
babu111.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
spike-reward
lm-evaluation-harness
A framework for few-shot evaluation of language models.
gg
一个支持节点与订阅链接的 Linux 命令行代理工具 | A command-line tool for one-click proxy in your research and development without installing v2ray or anything else (only for linux)
OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
SPIN
Unofficial implementation of Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Stable-Makeup
Pytorch Implementation of "Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model"
babu111's Repositories
babu111/spike-reward
babu111/alphageometry
babu111/babu111.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes