/awesome-segment-anything

Tracking and collecting papers/projects/others related to Segment Anything.

MIT LicenseMIT



Awesome Segment Anything Awesome

Segment Anything has led to a new breakthrough in the field of Computer Vision (CV), and this repository will continue to track and summarize the research progress of Segment Anything in various fields, including Papers/Projects, etc.

If you find this repository helpful, please consider Stars ⭐ or Sharing ⬆️. Thanks.

News

- 2023.5.4: Add SEEM paper.
- 2023.4.18: Add two nice job Inpainting Anything and SAM-Track.
- 2023.4.12: Add some presentations.
- 2023.4.12: An initial version of recent papers or projects.

Contents

Papers/Projects

Basemodel Papers

Title Presentation Paper page Project page Code base Affiliation Description
CLIP img arXiv Colab Code OpenAI Contrastive Language-Image Pre-Training.
OWL-ViT img ECCV2022 - Code Google A open-vocabulary object detector.
OvSeg img CVPR2023 Project Code META Segment an image into semantic regions according to text descriptions.
Painter img CVPR2023 - Code BAAI A Generalist Painter for In-Context Visual Learning.
Grounding DINO img arXiv Colab &Huggingface Code IDEA A stronger open-set object detector
Segment Anything imgimg arXiv Project page Code Meta A stronger Large model which can be used to generate masks for all objects in an image.
SegGPT img arXiv Project page Code BAAI Segmenting Everything In Context based on Painter.
Segment Everything Everywhere All at Once (SEEM) img arXiv Project Page Code Microsoft Semantic Segmentation with various prompt types.

Derivative Papers

Title Presentation Paper page Project page Code base Affiliation Description
CLIP_Surgery img arXiv Demo Code HKUST This work about SAM based on CLIP's explainability to achieve text to mask without manual points.
Segment Anything Model (SAM) for Digital Pathology img arXiv - - - SAM + Tumor segmentation/Tissue segmentation/Cell nuclei segmentation.
SAMCOD - arXiv - Code - SAM + Camouflaged object detection (COD) task.
Segment Anything Is Not Always Perfect img arXiv - - Samsung This paper analyze and discuss the benefits and limitations of SAM.
Inpaint Anything img1 arXiv - Code USTC & EIT SAM + Inpainting, which is able to remove the object smoothly.
PerSAM img arXiv Project Page Code - Segment Anything with specific concepts
Segment Anything in Medical Images img1 arXiv - Code - A step-by-step tutorial with a small dataset to help you quickly utilize SAM.
Segment Anything in Video Super-resolution img1 arXiv - - - The first step to use SAM for low-level vision.

Derivative Projects

Image Segmentation task

Title Presentation Project page Code base Affiliation Description
Grounded Segment Anything img Colab & Huggingface Code - Combining Grounding DINO and Segment Anything
GroundedSAM Anomaly Detection img - Code - Grounding DINO + SAM to segment any anomaly.
Semantic Segment Anything img - Code Fudan A dense category annotation engine.
Magic Copy img - Code - Magic Copy is a Chrome extension that uses SAM.
Segment Anything with Clip img - Code - SAM + CLIP
SAM-Clip img - Code - SAM + CLIP.
Prompt Segment Anything img - Code - SAM + Zero-shot Instance Segmentation.
RefSAM - - Code - Evaluating the basic performance of SAM on the Referring Image Segmementation task.
SAM-RBox img - Code - An implementation of SAM for generating rotated bounding boxes with MMRotate.
Open Vocabulary Segment Anything img1 - Code - An interesting demo by combining OWL-ViT of Google and SAM.
SegDrawer img1img - Code - Simple static web-based mask drawer, supporting semantic drawing with SAM.
Annotation Anything Pipeline img YoutubeDemo Code - SAM + Labelme + LabelImg + Auto-labeling.
AnyLabel img - Code - GPT + SAM.
Roboflow Annotate roboflow-sam-optimized-faster App Blog Roboflow SAM-assisted labeling for training computer vision models.
SALT img - Code - A tool that adds a basic interface for image labeling and saves the generated masks in COCO format.]
SAM U Specify img - Code - Use SAM and CLIP model to segment unique instances you want.]
SAM web UI img App Code - This is a new web interface for the SAM.

Video Segmentation task

Title Presentation Project page Code base Affiliation Description
MetaSeg img HuggingFace Code - SAM + Video.
SAM-Track Video YoutubeDemo Code Zhejiang University This project, which is based on SAM and DeAOT, focuses on segmenting and tracking objects in videos.

Medical image Segmentation task

Title Presentation Project page Code base Affiliation Description
SAM in Napari Video - Code - Segment anything with Napari integration of SAM.
SAM Medical Imaging img - Code - SAM for Medical Imaging.

Inpainting task

Title Presentation Project page Code base Affiliation Description
SegAnythingPro img - Code - SAM + Inpainting/Replacing.

3D task

Title Presentation Project page Code base Affiliation Description
3D-Box img - Code - SAM is extended to 3D perception by combining it with VoxelNeXt.
Anything 3DNovel View img - Code - SAM + Zero 1-to-3.
Any 3DFace imgimg - Code - SAM + HRN.
Segment Anything 3D img - Code Pointcept Extending Segment Anything to 3D perception by transferring the segmentation information of 2D images to 3D space

Image Generation task

Title Presentation Project page Code base Affiliation Description
Edit Anything img - Code - Edit and Generate Anything in an image.
Image Edit Anything img - Code - Stable Diffusion + SAM.
SAM for Stable Diffusion Webui img - Code - Stable Diffusion + SAM.

Remote Sensing task

Title Presentation Project page Code base Affiliation Description
Earth Observation Tools img Colab Code - SAM + Remote Sensing.

Moving Object Detection task

Title Presentation Project page Code base Affiliation Description
Moving Object Detection img - Code - SAM + Moving Object Detection.

OCR task

Title Presentation Project page Code base Affiliation Description
OCR-SAM img Blog Code - Optical Character Recognition with SAM.

Acknowledgement

Some of the presentations in this repository are borrowed from the original author, and we are very thankful for their contribution.

License

This project is released under the MIT license. Please see the LICENSE file for more information.