Immocat's Stars
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
black-forest-labs/flux
Official inference repo for FLUX.1 models
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
magic-research/magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Doubiiu/ToonCrafter
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
lllyasviel/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
GaParmar/img2img-turbo
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
rusty1s/pytorch_cluster
PyTorch Extension Library of Optimized Graph Cluster Algorithms
tim-learn/awesome-test-time-adaptation
Collection of awesome test-time (domain/batch/instance) adaptation methods
warmshao/FasterLivePortrait
Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!
madebyollin/taesd
Tiny AutoEncoder for Stable Diffusion
microsoft/StyleSwin
[CVPR 2022] StyleSwin: Transformer-based GAN for High-resolution Image Generation
SHI-Labs/Smooth-Diffusion
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024
ChenyangLEI/deep-video-prior
[NeurIPS 2020] Blind Video Temporal Consistency via Deep Video Prior
snap-research/MoCoGAN-HD
[ICLR 2021 Spotlight] A Good Image Generator Is What You Need for High-Resolution Video Synthesis
MitchellX/deepfake-models
List some popular DeepFake models e.g. DeepFake, FaceSwap-MarekKowal, IPGAN, FaceShifter, FaceSwap-Nirkin, FSGAN, SimSwap, CihaNet, etc.
andrerochow/fsrt
Official implementation of the CVPR 2024 paper "FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features"
chenxx89/BFRffusion
Official codes of Towards Real-World Blind Face Restoration with Generative Diffusion Prior
EternalEvan/FlowIE
This repository contains the official implementation of "FlowIE: Efficient Image Enhancement via Rectified Flow"
mkshing/DiffFit-pytorch
Implementation of "DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning"
bernakabadayi/ganavatar
[3DV'24] GAN-Avatar: Controllable Personalized GAN-based Human Head Avatar
YangNaruto/latent-energy-transport
aiiu-lab/MeDM
Official Pytorch Implementation of "MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence Guidance"in AAAI 2024.