EchoForger's Stars
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
salesforce/HIVE
GaParmar/clean-fid
PyTorch - FID calculation with proper image resizing and quantization steps [CVPR 2022]
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
fatPeter/mini-splatting
RedAIGC/StoryMaker
StoryMaker: Towards consistent characters in text-to-image generation
konrad-gajdus/miniMNIST-c
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
LTH14/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
abdo-eldesokey/build-a-scene
Official repository for "Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation"
jellyfin/jellyfin
The Free Software Media System
PanchengZhao/LAKE-RED
[CVPR 2024] LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion.
csyxwei/ELITE
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)
YBYBZhang/ControlVideo
[ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"
PJLallen/OSFormer
Official Implementation of ECCV2022 paper "OSFormer: One-Stage Camouflaged Instance Segmentation with Transformers"
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
jiaosiyu1999/MAFT
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
jiuntian/interactdiffusion
[CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
DirtyHarryLYL/HOI-Learning-List
A list of Human-Object Interaction Learning.
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
ssecv/PSR
The official implementation of ACM MM 2023 "Partitioned Saliency Ranking with Dense Pyramid Transformers"
PKUFlyingPig/cs-self-learning
计算机自学指南
freeCodeCamp/freeCodeCamp
freeCodeCamp.org's open-source codebase and curriculum. Learn to code for free.
mli/paper-reading
深度学习经典、新论文逐段精读