shu-le

Zhejiang University

shu-le's Stars

YangLing0818/RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
Language:Jupyter Notebook1.7k96
Shentao-YANG/Dense_Reward_T2I
Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).
Language:Python28
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Language:Python3.9k304
CaraJ7/CoMat
[Neurips 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
Language:Python1315
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Language:Python2.1k86
mihirp1998/AlignProp
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion
Language:Python2358
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python19.8k2.2k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python35.1k4.1k
YBYBZhang/VideoElevator
[Arxiv 2024] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models"
Language:Python1444
yk7333/d3po
[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
Language:Python16414
UMass-Foundation-Model/3D-LLM
Code for 3D-LLM: Injecting the 3D World into Large Language Models
Language:Python93458
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
1.8k90
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.4k1k
Mowenyii/PAE
[CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation
Language:Python578
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python9.8k1.2k
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.6k401
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python2.1k172
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
Language:Python4.1k293
microsoft/DeepSpeedExamples
Example models using DeepSpeed
Language:Python6k1k
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Language:Python2.7k176
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
Language:Python1.7k176
evalcrafter/EvalCrafter
[CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
Language:Jupyter Notebook1327
lixinustc/KVQ-Challenge-CVPR-NTIRE2024
The first challenge on short-form video quality assessment
Language:Python59
haoningwu3639/StoryGen
[CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models
Language:Python2029
heheyas/V3D
V3D: Video Diffusion Models are Effective 3D Generators
Language:Python45017
TylerYep/torchinfo
View model summaries in PyTorch!
Language:Python2.6k119
ExponentialML/Text-To-Video-Finetuning
Finetune ModelScope's Text To Video model using Diffusers 🧨
Language:Python657105
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Language:Python2.9k260
google/latexify_py
A library to generate LaTeX expression from Python code.
Language:Python7.2k383
llava-rlhf/LLaVA-RLHF
Aligning LMMs with Factually Augmented RLHF
Language:Python31521

shu-le

shu-le's Stars

YangLing0818/RPG-DiffusionMaster

Shentao-YANG/Dense_Reward_T2I

InternLM/xtuner

CaraJ7/CoMat

Alpha-VLLM/Lumina-T2X

mihirp1998/AlignProp

haotian-liu/LLaVA

microsoft/DeepSpeed

YBYBZhang/VideoElevator

yk7333/d3po

UMass-Foundation-Model/3D-LLM

ChenHsing/Awesome-Video-Diffusion-Models

PKU-YuanGroup/Open-Sora-Plan

Mowenyii/PAE

huggingface/trl

huggingface/alignment-handbook

eric-mitchell/direct-preference-optimization

baichuan-inc/Baichuan2

microsoft/DeepSpeedExamples

PixArt-alpha/PixArt-alpha

Vchitect/Latte

evalcrafter/EvalCrafter

lixinustc/KVQ-Challenge-CVPR-NTIRE2024

haoningwu3639/StoryGen

heheyas/V3D

TylerYep/torchinfo

ExponentialML/Text-To-Video-Finetuning

ali-vilab/VGen

google/latexify_py

llava-rlhf/LLaVA-RLHF