shu-le

Zhejiang University

shu-le's Stars

BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
12.3k 271 115782
instantX-research/InstantID
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Language:Python11k 128 226803
magic-research/magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Language:Python10.4k 104 1461.1k
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
Language:Jupyter Notebook9.4k 103 161753
facebookresearch/sapiens
High-resolution models for human tasks.
Language:Python4.3k 43 135229
NUS-HPC-AI-Lab/VideoSys
VideoSys: An easy and efficient system for video generation
Language:Python1.7k 27 80115
THUDM/SwissArmyTransformer
SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.
Language:Python979 32 7993
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Language:Python931 12 2740
Vchitect/SEINE
[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
Language:Python900 25 3064
KovenYu/WonderJourney
Language:Python667 48 938
foivospar/Arc2Face
[ECCV 2024 Oral🔥] Arc2Face: A Foundation Model for ID-Consistent Human Faces
Language:Python581 17 2941
liuff19/ReconX
ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model
491 57 916
ali-vilab/FlashFace
Language:Python365 13 1534
ID-Animator/ID-Animator
Language:Python354 22 1726
guanjz20/StyleSync
Official code of CVPR '23 paper "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"
Language:Python299 32 1321
aim-uofa/MovieDreamer
249 21 37
zhenzhiwang/HumanVid
Official implementation of HumanVid, NeurIPS D&B Track 2024
Language:Python231 30 152
jeanne-wang/svd_keyframe_interpolation
Language:Python20711
RafailFridman/SceneScape
Official Pytorch Implementation for "SceneScape: Text-Driven Consistent Scene Generation"
Language:Python138 7 18
kyegomez/Vit-RGTS
Open source implementation of "Vision Transformers Need Registers"
Language:Python137 5 613
ZCMax/LLaVA-3D
A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World
Language:Python1344
vaew/SkyScript-100M
SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2
96 5 25
chen-wl20/DreamCinema
DreamCinema: Cinematic Transfer with Free Camera and 3D Character
85 10 21
QQ-MM/Video-CCAM
A lightweight flexible Video-MLLM developed by TencentQQ Multimedia Research Team.
Language:Python54 4 72
robincourant/DIRECTOR
Language:Python49 3 24
WUyinwei-hah/IFAdapter
Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".
490
eckertzhang/HumanRef
Language:Python42 1 03
tobran/StoryImager
[ECCV2024] StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion
35 5 32
baojudezeze/Generative-Virtual-Try-On
Generative virtual try on (VTON), try-on images of characters can be generated by text prompt.
Language:Jupyter Notebook19 3 03
kunyao2015/StyleLipSync
[ICCV 2023] Official pytorch implementation of "StyleLipSync: Style-based Personalized Lip-sync Video Generation".
Language:Python4 0 02

shu-le

shu-le's Stars

BradyFU/Awesome-Multimodal-Large-Language-Models

instantX-research/InstantID

magic-research/magic-animate

TencentARC/PhotoMaker

facebookresearch/sapiens

NUS-HPC-AI-Lab/VideoSys

THUDM/SwissArmyTransformer

showlab/Show-o

Vchitect/SEINE

KovenYu/WonderJourney

foivospar/Arc2Face

liuff19/ReconX

ali-vilab/FlashFace

ID-Animator/ID-Animator

guanjz20/StyleSync

aim-uofa/MovieDreamer

zhenzhiwang/HumanVid

jeanne-wang/svd_keyframe_interpolation

RafailFridman/SceneScape

kyegomez/Vit-RGTS

ZCMax/LLaVA-3D

vaew/SkyScript-100M

chen-wl20/DreamCinema

QQ-MM/Video-CCAM

robincourant/DIRECTOR

WUyinwei-hah/IFAdapter

eckertzhang/HumanRef

tobran/StoryImager

baojudezeze/Generative-Virtual-Try-On

kunyao2015/StyleLipSync