YRlin-12

YRlin-12's Stars

msracver/Deformable-ConvNets
Deformable Convolutional Networks
Language:Python4k957
princeton-vl/RAFT
Language:Python3.3k634
hkchengrex/XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
Language:Python1.8k193
ttt-matching-based-vos/ttt_matching_vos
Authors official PyTorch implementation of the "Test-time Training for Matching-based Video Object Segmentation" [NeurIPS 2023]
Language:Python9
hustvl/WeakSAM
[ACM MM 2024] WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition
Language:Python432
WarlockWendell/AggDet
official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation
Language:Python102
vidit09/domaingen
CLIP the Gap CVPR 2023
Language:Python697
zhengli97/PromptKD
[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"
Language:Python2353
ylingfeng/FGVP
Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023
Language:Python352
jianghaojun/Awesome-Parameter-Efficient-Transfer-Learning
A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.
39126
sail-sg/EditAnything
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
Language:Python3.3k190
FoundationVision/UniRef
[ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces
Language:Python23515
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Language:Jupyter Notebook5.9k597
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.5k1k
rioxwang/BUPTGraduateThesis
Language:TeX397113
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python22.2k2.2k
mkshing/ziplora-pytorch
Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"
Language:Python51737
advimman/lama
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Language:Jupyter Notebook8.1k858
ProjectNUWA/DragNUWA
73168
Stability-AI/generative-models
Generative Models by Stability AI
Language:Python24.6k2.7k
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Language:Python2.6k207
TencentARC/MotionCtrl
Official Code for MotionCtrl [SIGGRAPH 2024]
Language:Python1.3k71
rinongal/textual_inversion
Language:Jupyter Notebook2.9k280
thu-ml/prolificdreamer
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)
Language:Python1.5k44
ali-vilab/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
Language:Python4k366
open-mmlab/PIA
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA，你的个性化图像动画生成器，利用文本提示将图像变为奇妙的动画
Language:Python91575
showlab/VideoSwap
Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
35313
thepowerfuldeez/null-text-inversion
Unofficial implementation of paper NULL-text Inversion for Editing Real Images using Guided Diffusion Models ( https://arxiv.org/abs/2211.09794 )
Language:Jupyter Notebook764
kevinzakka/spatial-transformer-network
A Tensorflow implementation of Spatial Transformer Networks.
Language:Python989268
LukasBommes/mv-extractor
Extract frames and motion vectors from H.264 and MPEG-4 encoded video.
Language:C29960