This plugin allows you to apply Florence-2 directly to your FiftyOne datasets.
If you haven't already, install FiftyOne:
pip install -U fiftyone transformers accelerate
Then, install the plugin:
fiftyone plugins download https://github.com/jacobmarks/fiftyone_florence2_plugin
Generate captions in three levels of detail
- Detect objects using Florence-2's default classes
- Dense region captioning
- Open vocabulary object detection with text input
- Region proposals
- Detect text in images, with or without bounding boxes
- Ground an input phrase (caption) by detecting the relevant objects in the image. You can either specify the caption directly or use any
fo.StringField
on the dataset as the caption field.
- Segment the image based on the input referring expression. You can either specify the expression directly or use any
fo.StringField
on the dataset as the expression field.
Happy exploring!