zhangyunming

Shenzhen

zhangyunming's Stars

hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python17.4k 158 2761.6k
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python10.7k 160 177962
WongKinYiu/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
Language:Python8.4k 54 4201.2k
THU-MIG/yolov10
YOLOv10: Real-Time End-to-End Object Detection
Language:Python7.2k 38 195511
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
7.1k 316 251818
lllyasviel/stable-diffusion-webui-forge
Language:Python5k 58 451465
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Language:Python4.1k 59 157515
lllyasviel/IC-Light
More relighting!
Language:Python3.7k 40 56238
layerdiffusion/sd-forge-layerdiffuse
[WIP] Layer Diffusion for WebUI (via Forge)
Language:Python3.6k 36 87320
PKU-YuanGroup/Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Language:Python2.6k 27 155190
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Language:Python2.4k 41 0150
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Language:Python2.2k 29 68142
google-deepmind/gemma
Open weights LLM from Google DeepMind.
Language:Jupyter Notebook2.2k 34 24262
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
Language:Python1.8k 21 81104
lichao-sun/Mora
Mora: More like Sora for Generalist Video Generation
Language:Jupyter Notebook1.4k 70 782
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
Language:Python1.3k 22 7667
GaParmar/img2img-turbo
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
Language:Python1.3k 17 48132
PixArt-alpha/PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Language:Python1.1k 33 8657
sczhou/Upscale-A-Video
Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution
842 86 929
lxtGH/OMG-Seg
[CVPR-2024] One Model For Image/Video/Instractive/Open-Vocabulary Segmentation
Language:Python806 18 838
rlawjdghek/StableVITON
[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
Language:Python779 46 0118
LLaVA-VL/LLaVA-NeXT
Language:Python702 17 3933
foivospar/Arc2Face
Arc2Face: A Foundation Model of Human Faces
Language:Python468 15 1831
csslc/CCSR
Official codes of CCSR: Improving the Stability of Diffusion Models for Content Consistent Super-Resolution
Language:Python360 9 2529
cswry/SeeSR
[CVPR2024] SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
Language:Python310 7 5014
AlonzoLeeeooo/awesome-text-to-image-studies
A collection of awesome text-to-image generation studies.
Language:TeX21710
Kartik-3004/facexformer
Official implementation of FaceXFormer: A Unified Transformer for Facial Analysis
Language:Python158 8 1316
icandle/CAMixerSR
CAMixerSR: Only Details Need More “Attention” (CVPR 2024)
Language:Python148 4 2110
THUDM/CogCoM
Language:Python134 10 199
LIAGM/DAEFR
[ICLR 2024] DAEFR: Dual Associated Encoder for Face Restoration
Language:Python21 3 20

zhangyunming

zhangyunming's Stars

hpcaitech/Open-Sora

PKU-YuanGroup/Open-Sora-Plan

WongKinYiu/yolov9

THU-MIG/yolov10

HumanAIGC/EMO

lllyasviel/stable-diffusion-webui-forge

Zejun-Yang/AniPortrait

lllyasviel/IC-Light

layerdiffusion/sd-forge-layerdiffuse

PKU-YuanGroup/Video-LLaVA

PixArt-alpha/PixArt-alpha

Tencent/HunyuanDiT

google-deepmind/gemma

PKU-YuanGroup/MoE-LLaVA

lichao-sun/Mora

THUDM/CogVLM2

GaParmar/img2img-turbo

PixArt-alpha/PixArt-sigma

sczhou/Upscale-A-Video

lxtGH/OMG-Seg

rlawjdghek/StableVITON

LLaVA-VL/LLaVA-NeXT

foivospar/Arc2Face

csslc/CCSR

cswry/SeeSR

AlonzoLeeeooo/awesome-text-to-image-studies

Kartik-3004/facexformer

icandle/CAMixerSR

THUDM/CogCoM

LIAGM/DAEFR