oyly16's Stars
syp2ysy/VRP-SAM
[CVPR 2024] Official implementation of "VRP-SAM: SAM with Visual Reference Prompt"
facebookresearch/projectaria_tools
projectaria_tools is an C++/Python open-source toolkit to interact with Project Aria data
yformer/EfficientSAM
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
wl-zhao/VPD
[ICCV 2023] VPD is a framework that leverages the high-level and low-level knowledge of a pre-trained text-to-image diffusion model to downstream visual perception tasks.
JerryX1110/awesome-rvos
Referring Video Object Segmentation / Multi-Object Tracking Repo
FoundationVision/GLEE
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
fpv-iplab/EASG
Action Scene Graphs for Long-Form Understanding of Egocentric Videos (CVPR 2024)
ashkamath/mdetr
amazon-science/polygon-transformer
yongliu20/UniLSeg
[CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"
henghuiding/Vision-Language-Transformer
[ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation
mttr2021/MTTR
UX-Decoder/DINOv
[CVPR 2024] Official implementation of the paper "Visual In-context Learning"
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
keflanagan/CliMer
PlusLabNLP/ENVISION
Code for our EMNLP-2023 paper: "Localizing Active Objects from Egocentric Vision with Symbolic World Knowledge"
hkchengrex/Tracking-Anything-with-DEVA
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
zsef123/Connected_components_PyTorch
A PyTorch implementation of Connected Components Labeling
geekyutao/Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
SHI-Labs/OneFormer
OneFormer: One Transformer to Rule Universal Image Segmentation, arxiv 2022 / CVPR 2023
wjn922/ReferFormer
[CVPR2022] Official Implementation of ReferFormer
VainF/Awesome-Anything
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX
segments-ai/panoptic-segment-anything
Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation
UX-Decoder/Semantic-SAM
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
facebookresearch/ov-seg
This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
jingyi0000/VLM_survey
Collection of AWESOME vision-language models for vision tasks
epic-kitchens/VISOR-HOS
Code for recreating the HoS benchmark of VISOR
Hedlen/awesome-segment-anything
Tracking and collecting papers/projects/others related to Segment Anything.
venom12138/VSCOS
DavidZhangdw/Visual-Tracking-Development
Visual Object Tracking