Donghao-Li

student of Xidian University @Xidian University

Xidian UniversityXi'an

Donghao-Li's Stars

yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
Language:Python83.9k6.5k
haoningwu3639/StoryGen
[CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models
Language:Python2009
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Language:Python4.8k389
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Language:Python13.4k1.6k
CrazyBoyM/llama3-Chinese-chat
Llama3、Llama3.1 中文仓库（随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档）
Language:Python4k322
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python16k1.6k
VQAssessment/DOVER
[ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives". Official Weights and Demos provided.
Language:Jupyter Notebook26425
tgxs002/HPSv2
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
Language:Jupyter Notebook37212
zer0int/CLIP-fine-tune
Fine-tuning code for CLIP models
Language:Python1367
NeoVertex1/SuperPrompt
SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.
4.4k417
XLabs-AI/x-flux-comfyui
Language:Python87862
ostris/ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
Language:Python3k286
XLabs-AI/x-flux
Language:Python1.4k102
hzwer/Awesome-Optical-Flow
This is a list of awesome paper about optical flow and related work.
44830
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.4k599
tqdm/tqdm
:zap: A Fast, Extensible Progress Bar for Python and CLI
Language:Python28.4k1.4k
pkuliyi2015/multidiffusion-upscaler-for-automatic1111
Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
Language:Python4.7k334
instantX-research/InstantID
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Language:Python11k799
bytedance/1d-tokenizer
This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation
Language:Jupyter Notebook40516
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python5.4k393
modelscope/DiffSynth-Studio
Enjoy the magic of Diffusion models!
Language:Python6.4k575
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Language:Python88239
YingqingHe/Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Language:HTML31216
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
Language:Python2k135
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python7.8k729
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Language:Python2.7k174
ai-forever/Kandinsky-3
Language:Python31229
PixArt-alpha/PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Language:Python1.6k77
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Language:Python1.3k60
Kwai-Kolors/Kolors
Kolors Team
Language:Python3.7k240

Donghao-Li

Donghao-Li's Stars

yt-dlp/yt-dlp

haoningwu3639/StoryGen

THUDM/GLM-4

THUDM/ChatGLM3

CrazyBoyM/llama3-Chinese-chat

huggingface/peft

VQAssessment/DOVER

tgxs002/HPSv2

zer0int/CLIP-fine-tune

NeoVertex1/SuperPrompt

XLabs-AI/x-flux-comfyui

ostris/ai-toolkit

XLabs-AI/x-flux

hzwer/Awesome-Optical-Flow

facebookresearch/xformers

tqdm/tqdm

pkuliyi2015/multidiffusion-upscaler-for-automatic1111

instantX-research/InstantID

bytedance/1d-tokenizer

sgl-project/sglang

modelscope/DiffSynth-Studio

showlab/Show-o

YingqingHe/Awesome-LLMs-meet-Multimodal-Generation

THUDM/CogVLM2

THUDM/CogVideo

PixArt-alpha/PixArt-alpha

ai-forever/Kandinsky-3

PixArt-alpha/PixArt-sigma

dvlab-research/ControlNeXt

Kwai-Kolors/Kolors