Make output metrics extendable #199

jmazanec15 · 2022-05-31T17:39:25Z

Is your feature request related to a problem? Please describe.

For the k-NN plugin, I am working on adding a custom runner that will execute queries from a numeric data set and calculate the recall. k-NN plugin has an assortment of Approximate Nearest Neighbor algorithms. Generally, users will need to make tradeoffs between the approximateness of their system and the latency/throughput - so they need the ability to see both of these metrics when benchmarking.

In the custom query runner, I return the recall alongside the latency, but this only gets stored as request meta data - not as an outputted result.

Describe the solution you'd like

I would like the ability to specify that "recall" should be output as a metric in the results and define the aggregation as taking the mean average.

In a more general sense, I would like the ability to be able to define custom metrics for runners and define their aggregations and get them to show up in the results.

Describe alternatives you've considered

Find the metadata and collect my metric this way -- requires a lot of manual effort

Additional context

gkamat · 2022-06-06T18:24:40Z

This is going to require flexibility in how the results metrics are defined, computed, processed and reported. It will take some consideration.

Removes recall calculation from benchmarking logic as this is delayed until opensearch-project/opensearch-benchmark#199 can be implemented. Signed-off-by: John Mazanec <[email protected]>

jmazanec15 · 2022-06-14T00:07:53Z

Right, I guess there are a few other applications I can think of that may require similar functionality: Anomaly Detection, Learning to Rank. For these, recall/accuracy are KPIs.

amitgalitz · 2022-06-16T00:45:29Z

+1 on this. extendable metrics would help Anomaly Detection as well, we are starting to define how we benchmark AD in various ways such as our own execution time to get an anomaly result, recall/precision, and other KPIs on our own specific workloads and as a detector is running. I also want to add that this will greatly benefit ML-Commons as well

cgchinmay · 2023-11-08T02:22:52Z

@IanHoang as discussed offline, taking a look at this issue

peteralfonsi · 2024-01-04T20:09:41Z

Added a new issue #435 which would allow the user to specify percentiles they want to see, which would be a subset of this issue

jmazanec15 added the enhancement New feature or request label May 31, 2022

jmazanec15 mentioned this issue Jun 20, 2022

Add Querying Functionality to OSB opensearch-project/k-NN#409

Merged

5 tasks

bbarani added this to OpenSearch Engineering Effectiveness Feb 13, 2023

bbarani moved this to Backlog in OpenSearch Engineering Effectiveness Feb 13, 2023

jmazanec15 mentioned this issue Oct 11, 2023

[RFC] Enhancements for OSB Workloads #253

Open

IanHoang added the good first issue Good for newcomers label Oct 19, 2023

IanHoang assigned cgchinmay Nov 14, 2023

peteralfonsi mentioned this issue Jan 4, 2024

Allow user to provide desired percentiles for latency/other metrics #435

Closed

IanHoang added the Low Priority label Feb 15, 2024

finnroblin mentioned this issue Jul 5, 2024

Publish recall as a kpi metric #581

Merged

1 task

github-project-automation bot added this to OpenSearch Benchmark Roadmap Aug 30, 2024

github-project-automation bot moved this to Backlog in OpenSearch Benchmark Roadmap Aug 30, 2024

gkamat added Medium Priority and removed Low Priority labels Sep 13, 2024

gkamat removed this from OpenSearch Benchmark Roadmap Sep 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make output metrics extendable #199

Make output metrics extendable #199

jmazanec15 commented May 31, 2022

gkamat commented Jun 6, 2022

jmazanec15 commented Jun 14, 2022

amitgalitz commented Jun 16, 2022

cgchinmay commented Nov 8, 2023

peteralfonsi commented Jan 4, 2024

Make output metrics extendable #199

Make output metrics extendable #199

Comments

jmazanec15 commented May 31, 2022

gkamat commented Jun 6, 2022

jmazanec15 commented Jun 14, 2022

amitgalitz commented Jun 16, 2022

cgchinmay commented Nov 8, 2023

peteralfonsi commented Jan 4, 2024