[META] [FEATURE] Add Vector Search Query level metrics to understand latency for different steps in Vector Search Query #1985

navneet1v · 2024-08-19T07:19:15Z

Description

As of 2.16 version of k-NN plugin a vector search query not only does vector search on a native engine index, but does more than that.

Example 1

In Efficient filtering we first run the filters, converts the filters iterators to bitsets and then do either exact search or ANN Search based on certain condition.

Example 2

With new Disk Based Vector Search feature we will be doing 2 phased search first on oversampled k and then rescoring of those top k with full precision vectors.

In both examples we are not just doing vector search but much more and as of today we have no way to know what is the latency of these internal operations. I do agree there is a profile API that gives the breakdown that API is doesn't track the above mentioned granular operations of a query. Another thing is profile api is a point in time latency and most of the time users are interested in query latency stats over time and also its sub operations.

Solution

I can think of below solutions:

Improving the profile api results for vector search query to include these sub operations. We can take some inspiration of how Bool query/disMax query does it.
We should look into QueryInsights plugin and see how we can add these sub operations stats via that plugin rather than emitting them via cluster stats.

I don't think the above 2 solutions will be enough but I see that as a start and may be we need to add integrations at few more places to be really have a good mechanism for query stats.

navneet1v · 2024-11-25T18:10:08Z

@shatejas did an implementation of this feature via profile API as referenced here: #2286

navneet1v · 2024-11-27T08:41:19Z

Another reference for same feature: #2289

navneet1v added untriaged enhancement and removed untriaged labels Aug 19, 2024

github-actions bot added the untriaged label Aug 19, 2024

navneet1v removed the untriaged label Aug 19, 2024

navneet1v added this to Vector Search RoadMap Aug 19, 2024

navneet1v moved this to Backlog (Hot) in Vector Search RoadMap Aug 19, 2024

navneet1v added the search-improvements label Sep 14, 2024

navneet1v mentioned this issue Nov 25, 2024

[FEATURE] Additional components in profile breakdown for KNN query to increase visibility #2286

Open

navneet1v mentioned this issue Nov 27, 2024

Add debug information for knn queries #2289

Open

navneet1v changed the title ~~[FEATURE] Add Vector Search Query level metrics to understand latency for different steps in Vector Search Query~~ [META] [FEATURE] Add Vector Search Query level metrics to understand latency for different steps in Vector Search Query Nov 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[META] [FEATURE] Add Vector Search Query level metrics to understand latency for different steps in Vector Search Query #1985

[META] [FEATURE] Add Vector Search Query level metrics to understand latency for different steps in Vector Search Query #1985

navneet1v commented Aug 19, 2024 •

edited

Loading

navneet1v commented Nov 25, 2024

navneet1v commented Nov 27, 2024

[META] [FEATURE] Add Vector Search Query level metrics to understand latency for different steps in Vector Search Query #1985

[META] [FEATURE] Add Vector Search Query level metrics to understand latency for different steps in Vector Search Query #1985

Comments

navneet1v commented Aug 19, 2024 • edited Loading

Description

Example 1

Example 2

Solution

navneet1v commented Nov 25, 2024

navneet1v commented Nov 27, 2024

navneet1v commented Aug 19, 2024 •

edited

Loading