/Crop-CLIP

Crop using CLIP

Primary LanguageJupyter NotebookMIT LicenseMIT

Crop-CLIP

Search subjects/objects in an image using simple text description and get cropped results.

Try it on Colab:

Highlights

"Search Query - Crop!"

"Whats the time"

Time

"Hoodie guy"

Hoodie

"Mini Cooper"

minicooper

"Whiskey Bottle"

Whiskey

How?

  • This is done by combining Object detection yolov5 and OpenAI's CLIP model.
  • Detects and crops objects (yolov5s)
  • Encode cropped images using CLIP
  • Encode search query using CLIP
  • Find the best match

Why?

  • #vacation ☺️

Can also be used to create datasets with some changes in code. In the below example images of Jack daniels bottle has been croped and saved.

Search Query on batch - "Jack Daniels"

dataset

Simple App

πŸ‘‰ Hugging Face Spaces πŸ‘ˆ

app

Limitations

  • Depends heavily on object detection(yolov5).
  • YOLOv5 πŸš€ is a family of object detection architectures and models pretrained on the COCO dataset, So detection depends on COCO classes.

Acknowledgements