Skip to content

Latest commit

 

History

History
58 lines (36 loc) · 2.3 KB

README.md

File metadata and controls

58 lines (36 loc) · 2.3 KB

Crop-CLIP

Search subjects/objects in an image using simple text description and get cropped results.

Try it on Colab:

Highlights

"Search Query - Crop!"

"Whats the time"

Time

"Hoodie guy"

Hoodie

"Mini Cooper"

minicooper

"Whiskey Bottle"

Whiskey

How?

  • This is done by combining Object detection yolov5 and OpenAI's CLIP model.
  • Detects and crops objects (yolov5s)
  • Encode cropped images using CLIP
  • Encode search query using CLIP
  • Find the best match

Why?

  • #vacation ☺️

Can also be used to create datasets with some changes in code. In the below example images of Jack daniels bottle has been croped and saved.

Search Query on batch - "Jack Daniels"

dataset

Simple App

app

Limitations

  • Depends heavily on object detection(yolov5).
  • YOLOv5 🚀 is a family of object detection architectures and models pretrained on the COCO dataset, So detection depends on COCO classes.

Acknowledgements