Top N queries by Latency - aggregator implementation #11295

ansjcy · 2023-11-21T21:29:05Z

Is your feature request related to a problem? Please describe.
Currently, OpenSearch lacks a direct means of providing insights into “top queries” that have a significant impact on latency, we want to build the in-memory aggregator to capture those queries on the coordinator node level.
(Parent rfc: #11186)

Describe the solution you'd like
Implement a priority queue-based in-memory data store, with configurable window size, on the coordinator node, designed to efficiently store the top N queries. The data model of the stored query attributes should be highly extensible for different types of resources and metrics.

Describe alternatives you've considered
We can instead export the data and offload the query instrumentation logic to external sinks. But it would require us to retain all necessary data for every request. Additionally, as of now we lack a state-of-the-art on-node store capable of holding all request information for aggregation and calculating the top N when user requested. Moreover, we refrain from introducing dependencies on a specific sink in this feature.

Additional context
Please see parent rfc: #11186

ansjcy · 2024-02-06T06:23:56Z

This is done by #11903

ansjcy added enhancement Enhancement or improvement to existing feature or request untriaged labels Nov 21, 2023

ansjcy mentioned this issue Nov 21, 2023

Asynchronous processor and exporter for query insights data #11296

Closed

ansjcy added this to Performance Roadmap Nov 21, 2023

ansjcy moved this to Now (This Quarter) in Performance Roadmap Nov 21, 2023

ansjcy moved this from Now (This Quarter) to In Progress in Performance Roadmap Nov 21, 2023

ansjcy self-assigned this Nov 21, 2023

ansjcy added the Search:Query Insights label Nov 21, 2023

This was referenced Dec 7, 2023

[Draft] Query Insight Plugin with Top Queries feature #11506

Closed

[META] Generic Query Insights Framework #11522

Open

getsaurabh02 moved this from In Progress to In-Review in Performance Roadmap Dec 15, 2023

getsaurabh02 added the v2.12.0 Issues and PRs related to version 2.12.0 label Jan 22, 2024

ansjcy moved this from In-Review to Now (This Quarter) in Performance Roadmap Jan 23, 2024

ansjcy closed this as completed Feb 6, 2024

github-project-automation bot moved this from Now (This Quarter) to Done in Performance Roadmap Feb 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Top N queries by Latency - aggregator implementation #11295

Top N queries by Latency - aggregator implementation #11295

ansjcy commented Nov 21, 2023

ansjcy commented Feb 6, 2024

Top N queries by Latency - aggregator implementation #11295

Top N queries by Latency - aggregator implementation #11295

Comments

ansjcy commented Nov 21, 2023

ansjcy commented Feb 6, 2024