zhangyunming's Stars
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
WongKinYiu/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
THU-MIG/yolov10
YOLOv10: Real-Time End-to-End Object Detection
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
lllyasviel/stable-diffusion-webui-forge
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
lllyasviel/IC-Light
More relighting!
layerdiffusion/sd-forge-layerdiffuse
[WIP] Layer Diffusion for WebUI (via Forge)
PKU-YuanGroup/Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
google-deepmind/gemma
Open weights LLM from Google DeepMind.
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
lichao-sun/Mora
Mora: More like Sora for Generalist Video Generation
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
GaParmar/img2img-turbo
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
PixArt-alpha/PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
sczhou/Upscale-A-Video
Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution
lxtGH/OMG-Seg
[CVPR-2024] One Model For Image/Video/Instractive/Open-Vocabulary Segmentation
rlawjdghek/StableVITON
[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
LLaVA-VL/LLaVA-NeXT
foivospar/Arc2Face
Arc2Face: A Foundation Model of Human Faces
csslc/CCSR
Official codes of CCSR: Improving the Stability of Diffusion Models for Content Consistent Super-Resolution
cswry/SeeSR
[CVPR2024] SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
AlonzoLeeeooo/awesome-text-to-image-studies
A collection of awesome text-to-image generation studies.
Kartik-3004/facexformer
Official implementation of FaceXFormer: A Unified Transformer for Facial Analysis
icandle/CAMixerSR
CAMixerSR: Only Details Need More “Attention” (CVPR 2024)
THUDM/CogCoM
LIAGM/DAEFR
[ICLR 2024] DAEFR: Dual Associated Encoder for Face Restoration