YRlin-12's Stars
msracver/Deformable-ConvNets
Deformable Convolutional Networks
princeton-vl/RAFT
hkchengrex/XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
ttt-matching-based-vos/ttt_matching_vos
Authors official PyTorch implementation of the "Test-time Training for Matching-based Video Object Segmentation" [NeurIPS 2023]
hustvl/WeakSAM
[ACM MM 2024] WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition
WarlockWendell/AggDet
official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation
vidit09/domaingen
CLIP the Gap CVPR 2023
zhengli97/PromptKD
[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"
ylingfeng/FGVP
Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023
jianghaojun/Awesome-Parameter-Efficient-Transfer-Learning
A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.
sail-sg/EditAnything
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
FoundationVision/UniRef
[ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
rioxwang/BUPTGraduateThesis
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
mkshing/ziplora-pytorch
Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"
advimman/lama
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
ProjectNUWA/DragNUWA
Stability-AI/generative-models
Generative Models by Stability AI
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
TencentARC/MotionCtrl
Official Code for MotionCtrl [SIGGRAPH 2024]
rinongal/textual_inversion
thu-ml/prolificdreamer
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)
ali-vilab/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
open-mmlab/PIA
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
showlab/VideoSwap
Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
thepowerfuldeez/null-text-inversion
Unofficial implementation of paper NULL-text Inversion for Editing Real Images using Guided Diffusion Models ( https://arxiv.org/abs/2211.09794 )
kevinzakka/spatial-transformer-network
A Tensorflow implementation of Spatial Transformer Networks.
LukasBommes/mv-extractor
Extract frames and motion vectors from H.264 and MPEG-4 encoded video.