wuwukongkong

wuwukongkong's Stars

huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python12k 84 1.5k1.6k
ccfddl/ccf-deadlines
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
Language:Vue6.9k 21 93476
TencentARC/InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
Language:Python3.7k 49 173394
google-research/parti
1.6k 55 984
THUDM/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
Language:Python1.3k 14 9667
nianticlabs/mickey
[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
Language:Python529 12 2235
yuvalkirstain/PickScore
Language:Python475 3 3429
cvpr-org/author-kit
Language:TeX465 11 3880
tgxs002/HPSv2
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
Language:Jupyter Notebook452 10 4116
mindspore-lab/mindone
one for all, Optimal generator with No Exception
Language:Python381 13 6781
yfzhang114/Awesome-Multimodal-Large-Language-Models
Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models
254 6 29
openpsi-project/ReaLHF
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
Language:Python223 4 2113
yk7333/d3po
[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
Language:Python195 7 1819
RockeyCoss/SPO
Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
Language:Python179 5 236
lyndonzheng/Free3D
[CVPR'24] Consistent Novel View Synthesis without 3D Representation
Language:Python154 15 74
jiawei-ren/insactor
[NeurIPS 2023] InsActor: Instruction-driven Physics-based Characters
Language:Python138 10 313
ExplainableML/ReNO
[NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
Language:Python123 5 910
sjtuplayer/SaRA
SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation
Language:Python108 8 47
rom1504/embedding-reader
Efficiently read embedding in streaming from any filesystem
Language:Python98 4 2820
abacusai/smaug
66 3 33
liujf69/Classic-Generative-Model
Simple code demos about classic AIGC models/Compilation of blogs and papers on classic AIGC models.
Language:Python60 1 07
pinterest/atg-research
Language:Python45 6 11
jacklishufan/diffusion-kto
The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility
Language:Python35 2 10
Shentao-YANG/Dense_Reward_T2I
Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).
Language:Python35 2 10
cccedric/cpql
This is the official PyTorch implementation of the paper "Boosting Continuous Control with Consistency Policy".
Language:Python31 2 13
general-preference/general-preference-model
Official implementation of paper "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment" (https://arxiv.org/abs/2410.02197)
Language:Python22 2 02
princeton-nlp/unintentional-unalignment
[ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization
Language:Python19 5 02
ZiyiZhang27/sdpo
Code for the paper "Aligning Few-Step Diffusion Models with Dense Reward Difference Learning"
Language:Python130
catalpaaa/DeMansia
Language:Python8 1 01
atelion/DPO-Stable-Diffusion
Language:Python42