/nutsh

A Platform for Visual Learning from Human Feedback

Primary LanguageTypeScriptApache License 2.0Apache-2.0

Code Style: Google

Demo · Doc · Discussion

nutsh_teaser_v3.mp4

nutsh is a platform designed for visual learning through human feedback. With a user-friendly interface and API, it supports a range of visual modalities, diverse human input methods, and learning mechanisms based on human feedback.

The project is currently in its early stages and under active development. Your feedback is highly appreciated.

Main features

  • ✅ Intuitive Interface
    • ✅ Modern frontend design
    • ⬜ Easy custom model integration (coming)
  • ✅ Multiple Visual Modalities
    • ✅ Images
    • ✅ Videos
    • ⬜ ... (coming)
  • ✅ Diverse Annotations
    • ✅ Masks
    • ✅ Polygons and polylines
    • ✅ Bounding boxes
  • ✅ Versatile Scenarios
    • ✅ Object tracking
    • ✅ Object parts linking
  • ✅ Model-assisted Segmentation
    • ✅ Integration with Segment Anything Model
    • ✅ Positive and negative point prompts
    • ✅ Focus on specific local areas
  • ✅ Learning from Human Feedback
    • ✅ Model fine-tuning based on user feedback

Install

eval "$(curl -sSL nutsh.ai/install)"

Refer to the quick start documentation for a more comprehensive guide.

Quick Overview

welcome.mp4

Diverse Annotations and Versatile Scenarios

Drawing Polygons

In addition to standard polygon operations, Bézier curves are supported, enabling precise annotations of curved objects, such as tires.

polygon_intro.mp4

Drawing masks

Create masks with pixel-level accuracy, ensuring a perfect fit with no gaps or overlaps.

mask_intro.mp4

Tracking Objects

Especially useful when annotating videos, the platform offers an effortless way to track objects across frames.

track_objects.mp4

Linking Object Parts

Objects obscured by others, resulting in fragmented representation, can easily be linked to form a cohesive annotation

link_parts.mp4

Convenient Shortcuts

The platform incorporates various shortcuts, streamlining the annotation process for enhanced speed and accuracy.

Cloning Slice

For polygons sharing a common edge segment, one can be cloned for the other, ensuring a seamless fit without any gaps or overlaps.

slice_clone.mp4

Interpolation

Given two annotations of the same type on both a start and an end frame, heuristic interpolation can automatically generate all intermediate annotations.

polygon_interpolation.mp4

Model-Assisted Segmentation

We offer an API that seamlessly integrates the human labeling interface with deep learning models. Our server facilitates model inference and tuning based on user input. Calculations can be executed on both CPU and GPU platforms.

For instance, our SAM module taps into the Segment Anything Model from Meta AI to enhance segmentation speed and efficiency. While the SAM models are openly accessible to the public, their labeling interface remains proprietary. We also offer advanced features, such as local prediction, to ensure top-notch segmentation results. For a detailed guide, please refer to our documentation.

Global Smart Segmentation

Leverage deep learning models to perform segmentation tasks across entire images, aiding the segmentation process.

global_smart_segmentation.mp4

Local Smart Segmentation

Direct your attention to specific regions of an image and request the model to generate segmentation predictions for that particular section. Such localized predictions often yield more detailed segmentations.

local_smart_segmentation.mp4

Learning from Human Feedback

In addition to utilizing prompts for refining predictions, users can make subsequent adjustments to these predictions. By gathering these modifications, you can train a model that's fine-tuned to your specific needs. Our SAM module comes equipped with features that assist in fine-tuning the SAM decoder seamlessly.

finetune_sam.mp4

Next Steps

Consult the documentation for further information, including:

and more!

Citation

If you find the platform useful for your research projects, please cite

@misc{nutsh,
  title = {nutsh: A Platform for Visual Learning from Human Feedback},
  author = {Xu Han and Fisher Yu},
  howpublished = {\url{https://github.com/SysCV/nutsh}},
  year = {2023}
}