AmosYaoMa's Stars
chwilms/SOS
SOS: Segment Object System for Open-World Instance Segmentation With Object Priors (accepted at ECCV 2024)
pasqualedem/LabelAnything
Multi-Class Few-Shot Semantic Segmentation with Visual Prompts
chengzeyi/stable-fast
Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
cvlab-stonybrook/BodyHands
Whose Hands Are These? Hand Detection and Hand-Body Association in the Wild, CVPR 2022
hustvl/Matte-Anything
[Image and Vision Computing (Vol.147 Jul. '24)] Interactive Natural Image Matting with Segment Anything Models
ibaiGorordo/ONNX-SAM2-Segment-Anything
Python scripts for the Segment Anythin 2 (SAM2) model in ONNX
IDEA-Research/Grounded-SAM-2
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
tianrun-chen/SAM-Adapter-PyTorch
Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts
YichiZhang98/SAM4MIS
SAM & SAM 2 for Medical Image Segmentation: Open-Source Project Summary
ZiyuGuo99/SAM2Point
The Most Faithful Implementation of Segment Anything (SAM) in 3D
Kwai-Kolors/Kolors
Kolors Team
sujanshresstha/SAM2-in-video
This repository contains code for deploying a Gradio application using the SAM2 model for video processing. The application allows users to interact with the model through a user-friendly web interface.
WZH0120/SAM2-UNet
SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
IDEA-Research/MaskDINO
[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
cvlab-kaist/CAT-Seg
Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"
IDEA-Research/OpenSeeD
[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"
THU-MIG/yolov10
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
MedicineToken/Medical-SAM2
Medical SAM 2: Segment Medical Images As Video Via Segment Anything Model 2
sagieppel/fine-tune-train_segment_anything_2_in_60_lines_of_code
The repository provides code for training/fine tune the Meta Segment Anything Model 2 (SAM 2)
Leiyi-Hu/mona
The official implementation of "Adapter is All You Need for Tuning Visual Tasks".
facebookresearch/hiera
Hiera: A fast, powerful, and simple hierarchical vision transformer.
Traffic-X/ViT-CoMer
Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
tding1/AdaContour
AdaContour: Adaptive Contour Descriptor with Hierarchical Representation
IMPLabUniPr/mmdetection
Open MMLab Detection Toolbox and Benchmark
facebookresearch/dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
AssafSinger94/dino-tracker
Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video”
yoxu515/aot-benchmark
An efficient modular implementation of Associating Objects with Transformers for Video Object Segmentation in PyTorch