ChenDRAG

Tsinghua UniversityBeijing

ChenDRAG's Stars

huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python137k 1.1k 16.4k27.5k
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.4k 351 1.8k4.6k
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Language:Python27k 211 4.4k5.5k
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.8k 154 3671k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python10.5k 77 1.3k1.4k
lllyasviel/IC-Light
More relighting!
Language:Python7.2k 56 103426
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
Language:Python5.3k 50 188401
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.9k 111 137420
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python2.3k 19 84189
intel/intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Language:Python2.1k 28 166210
openai/consistencydecoder
Consistency Distilled Diff VAE
Language:Python2.1k 21 2076
OpenLLMAI/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python1.8k 21 179168
THUDM/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
Language:Python1.2k 15 9465
baofff/U-ViT
A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".
Language:Jupyter Notebook950 12 2864
opendilab/awesome-diffusion-model-in-rl
A curated list of Diffusion Model in RL resources (continually updated)
909 20 148
lichao-sun/SoraReview
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".
491 9 220
apexrl/Diff4RLSurvey
This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"
490 12 221
kvablack/ddpo-pytorch
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
Language:Python459 6 2442
yuvalkirstain/PickScore
Language:Python459 3 3328
tgxs002/HPSv2
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
Language:Jupyter Notebook427 10 4114
jannerm/ddpo
Code for the paper "Training Diffusion Models with Reinforcement Learning"
Language:Python375 7 1126
OpenBMB/UltraFeedback
A large-scale, fine-grained, diverse preference dataset (and models).
Language:Python322 10 1315
SalesforceAIResearch/DiffusionDPO
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
Language:Python301 7 1525
OpenBMB/Eurus
Language:Python293 11 1114
mihirp1998/AlignProp
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion
Language:Python251 6 158
yk7333/d3po
[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
Language:Python183 8 1618
alibaba/VideoMV
VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model
Language:Python158 6 514
thu-ml/Noise-Contrastive-Alignment
Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)
Language:Python40 8 54
somvy/slic-hf
Experiments of divergence functions for DPO, RLHF
Language:Jupyter Notebook7 1 00
Ghost---Shadow/sequence-likelihood-calibration
Reproduction of SLiC-HF: Sequence Likelihood Calibration with Human Feedback
Language:Python5 4 02

ChenDRAG

ChenDRAG's Stars

huggingface/transformers

lm-sys/FastChat

huggingface/diffusers

PKU-YuanGroup/Open-Sora-Plan

huggingface/trl

lllyasviel/IC-Light

imoneoi/openchat

huggingface/alignment-handbook

eric-mitchell/direct-preference-optimization

intel/intel-extension-for-transformers

openai/consistencydecoder

OpenLLMAI/OpenRLHF

THUDM/ImageReward

baofff/U-ViT

opendilab/awesome-diffusion-model-in-rl

lichao-sun/SoraReview

apexrl/Diff4RLSurvey

kvablack/ddpo-pytorch

yuvalkirstain/PickScore

tgxs002/HPSv2

jannerm/ddpo

OpenBMB/UltraFeedback

SalesforceAIResearch/DiffusionDPO

OpenBMB/Eurus

mihirp1998/AlignProp

yk7333/d3po

alibaba/VideoMV

thu-ml/Noise-Contrastive-Alignment

somvy/slic-hf

Ghost---Shadow/sequence-likelihood-calibration