babu111

Pinned Repositories

alphageometry
Language:Python0 0 00
babu111.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript00
spike-reward
Language:Python21
lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python7.1k 39 1.2k1.9k
gg
一个支持节点与订阅链接的 Linux 命令行代理工具 | A command-line tool for one-click proxy in your research and development without installing v2ray or anything else (only for linux)
Language:Go1.5k 8 73118
OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
Language:Python2.8k 24 299261
SPIN
Unofficial implementation of Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Language:Python7 2 20
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python31k 251 5.4k4.7k
Stable-Makeup
Pytorch Implementation of "Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model"
Language:Python131 11 2611

babu111's Repositories

babu111/spike-reward
Language:Python21
babu111/alphageometry
Language:Python0 0 00
babu111/babu111.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript00