new: Added jina clip v1 #408

hh-space-invader · 2024-11-19T11:30:21Z

Adding jinaai/jina-clip-v1
They provided two examples, the first one works and the second one complains about missing jinaai/jina-clip-v1/sentence_xlnet_config.json. The output of the first one seems to have small numbers, like they are normallized but its not mentioned so not sure tbh.

Update:
The text model needs pooling and normalizing
The image model needs the image to be square

All Submissions:

Have you followed the guidelines in our Contributing document?
Have you checked to ensure there aren't other open Pull Requests for the same update/change?

New Feature Submissions:

Does your submission pass the existing tests?
Have you added tests for your feature?
Have you installed pre-commit with pip3 install pre-commit and set up hooks with pre-commit install?

New models submission:

Have you added an explanation of why it's important to include this model?
Have you added tests for the new model? Were canonical values for tests computed via the original model?
Have you added the code snippet for how canonical values were computed?
Have you successfully ran tests with your changes locally?

fastembed/image/transform/functional.py

fastembed/image/transform/operators.py

tests/test_text_onnx_embeddings.py

fastembed/image/transform/operators.py

joein · 2024-12-03T20:49:07Z

fastembed/image/transform/functional.py

+    height, width = image.height, image.width
+
+    # if the size is larger than the new canvas
+    if width > size or height > size:


should not it be if width >= size and height >= size ?

It should be or. Assuming that size=500, width=600, height=400. With or, it will trigger cuz width>size, with and it will not trigger cuz hight <= size (and in theory we want either dimension if higher to be cropped)

what will happen to the second dimension?

if height and width are 600 and 400, required size is 500, the result shape will be (500, 500)
So, what are those 100 pixels which occur in width? What are their values? Which color was used to fill them?

It turns out that internally, the crop function pads the smaller side with zeros by default and the fill_color is not used if one of the sides > size. I modified the implementation.

fastembed/image/transform/functional.py

joein · 2024-12-03T20:49:40Z

fastembed/image/transform/functional.py

+        image = image.crop((left, top, right, bottom))
+        return image
+
+    new_image = Image.new(mode="RGB", size=(size, size), color=fill_color)


What if we pass a grayscale image?

In our post processor, the first operation is to change the image to RGB, so it shouldn't happen

fastembed/image/transform/operators.py

joein · 2024-12-03T20:53:22Z

fastembed/image/transform/operators.py

+    @staticmethod
+    def _interpolation_resolver(resample: Optional[str] = None) -> Image.Resampling:
+        interpolation_map = {
+            "nearest": Image.Resampling.NEAREST,
+            "lanczos": Image.Resampling.LANCZOS,
+            "bilinear": Image.Resampling.BILINEAR,
+            "bicubic": Image.Resampling.BICUBIC,
+            "box": Image.Resampling.BOX,
+            "hamming": Image.Resampling.HAMMING,
+        }
+
+        if resample and (method := interpolation_map.get(resample.lower())):
+            return method
+
+        raise ValueError(f"Unknown interpolation method: {resample}")


feels like it should not be a part of Compose class

I felt the same. Got any suggestions ? fastembed/common/utils ?

idk for sure, we can just move it out of the class
at least, this Compose._interpolation_resolver is super ugly, if we keep this method here, we need to make get_resize a class method, not a static, it is only used inside of Compose class anyway

I've changed get_resize to cls method as _interpolation_resolver would only be used in here

fastembed/text/pooled_embedding.py

joein

please look at the comments above

new: added resize2square

Co-authored-by: George <[email protected]>

hh-space-invader requested review from I8dNLo and joein November 19, 2024 11:30

hh-space-invader force-pushed the support-jina-clip branch from 86287ef to 82e2d4b Compare November 21, 2024 00:06

hh-space-invader changed the title ~~WIP: Added jina clip text embedding~~ new: Added jina clip text embedding Nov 21, 2024

joein requested changes Nov 21, 2024

View reviewed changes

fastembed/image/transform/functional.py Outdated Show resolved Hide resolved

fastembed/image/transform/operators.py Outdated Show resolved Hide resolved

hh-space-invader force-pushed the support-jina-clip branch from aa6be34 to 2de91d5 Compare November 25, 2024 00:11

hh-space-invader requested review from joein and generall November 25, 2024 00:12

I8dNLo approved these changes Nov 26, 2024

View reviewed changes

tests/test_text_onnx_embeddings.py Outdated Show resolved Hide resolved

fastembed/image/transform/operators.py Outdated Show resolved Hide resolved

fastembed/image/transform/operators.py Show resolved Hide resolved

fastembed/image/transform/operators.py Show resolved Hide resolved

joein requested changes Dec 3, 2024

View reviewed changes

joein changed the title ~~new: Added jina clip text embedding~~ new: Added jina clip v1 Dec 4, 2024

hh-space-invader requested a review from joein December 6, 2024 11:24

joein requested changes Dec 6, 2024

View reviewed changes

hh-space-invader and others added 17 commits December 10, 2024 04:31

WIP: Added jina clip text embedding

8dbc562

WIP: Added preprocess for jina clip

c1ff4b4

WIP: Added jina clip vision (not sure if it works yet)

464f2f4

improve: Improved mean pooling if the output doesnt have seq length

0c6bf70

fix: Fixed jina clip text

ef673a2

nit

98fc92e

fix: Fixed jina clip image preprocessor

7684805

fix: Fix type hints

34ae2e0

new: added resize2square

tests: Add jina clip vision test case

e7e0986

nit

cf4ac9e

refactor: Update fastembed/image/transform/operators.py

eb7b425

Co-authored-by: George <[email protected]>

fix: Fix indentation

e8a15b9

refactor: Refactored how we call padding for image

2d2e708

fix: Fix pad to image when resized size larger than new square canvas

b22abcc

refactor: minor refactor

0f9cbf6

refactor: Refactor some functions in preprocess image

377836c

fix: Fix to pad image with specified fill color

67d3ef7

hh-space-invader added 2 commits December 10, 2024 04:32

refactor: Change resize to classmethod

63e294b

fix: Fix jina clip text v1

7627d78

hh-space-invader force-pushed the support-jina-clip branch from 74a381e to 7627d78 Compare December 10, 2024 02:33

fix: fix pad to square for some rectangular images (#421)

3a847f6

joein approved these changes Dec 14, 2024

View reviewed changes

hh-space-invader merged commit 3b5e4c8 into main Dec 16, 2024
17 checks passed

hh-space-invader deleted the support-jina-clip branch December 16, 2024 10:45

I8dNLo mentioned this pull request Dec 24, 2024

V0.5.0 #430

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

new: Added jina clip v1 #408

new: Added jina clip v1 #408

hh-space-invader commented Nov 19, 2024 •

edited

Loading

joein Dec 3, 2024

hh-space-invader Dec 4, 2024

joein Dec 4, 2024

hh-space-invader Dec 5, 2024

joein Dec 3, 2024

hh-space-invader Dec 4, 2024

joein Dec 3, 2024

hh-space-invader Dec 4, 2024

joein Dec 4, 2024

hh-space-invader Dec 5, 2024

joein left a comment

new: Added jina clip v1 #408

new: Added jina clip v1 #408

Conversation

hh-space-invader commented Nov 19, 2024 • edited Loading

All Submissions:

New Feature Submissions:

New models submission:

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

joein left a comment

Choose a reason for hiding this comment

hh-space-invader commented Nov 19, 2024 •

edited

Loading