[FEATURE] Allow more content type for neural query with multimodal #474

martin-gaievski · 2023-10-25T21:47:55Z

Is your feature request related to a problem?

Currently neural-search supports text and image fields for generation of embeddings in both ingestion and search. Content can be of other types like audio or video information, and that is not supported today, e.g. for search there are only query_text and query_image fields.

What solution would you like?

Ability to pass content like audio or video for data ingestion and search.

What alternatives have you considered?

We can use other solutions to generate embeddings for audio or video content, and then post process results from OpenSearch and other systems.

Do you have any additional context?

It's a good extension for #318

The text was updated successfully, but these errors were encountered:

Sanjana679 · 2023-11-13T10:11:38Z

For videos, does it make sense to extract all the frames in a video and then generate embeddings for each frame? Likewise, for audio, would it make sense to make a transcription of the audio and then generate embeddings on the transcript?

I imagine there are issues with these approaches, but these were my first thoughts and I was wondering if anyone had suggestions for something better.

heemin32 · 2023-11-13T16:49:24Z

For videos, embeddings for frame makes sense. For audio, transcription will lose some information like intonation or volume of the audio.

martin-gaievski added Features Introduces a new unit of functionality that satisfies a requirement untriaged enhancement labels Oct 25, 2023

navneet1v added this to Vector Search RoadMap Oct 30, 2023

github-project-automation bot moved this to Backlog in Vector Search RoadMap Oct 30, 2023

vamshin removed the untriaged label Oct 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Allow more content type for neural query with multimodal #474

[FEATURE] Allow more content type for neural query with multimodal #474

martin-gaievski commented Oct 25, 2023

Sanjana679 commented Nov 13, 2023

heemin32 commented Nov 13, 2023

[FEATURE] Allow more content type for neural query with multimodal #474

[FEATURE] Allow more content type for neural query with multimodal #474

Comments

martin-gaievski commented Oct 25, 2023

Is your feature request related to a problem?

What solution would you like?

What alternatives have you considered?

Do you have any additional context?

Sanjana679 commented Nov 13, 2023

heemin32 commented Nov 13, 2023