[FEATURE] Avoid throttling when writing data to index #640

ykmr1224 · 2024-09-10T18:38:32Z

Is your feature request related to a problem?
When flint execute indexing query, each worker node will call OpenSearch bulk API to write data.
Currently it could end up causing throttling and execution failure due to too high request rate.
Although already implemented rate limiter (#567) and bulk request retry (#572), it still have high chance to cause failure especially when:

There are many worker node
OpenSearch Serverless is used as index store (does not support WAIT_FOR refresh policy)
Other job or other process send requests to the destination

What solution would you like?
To fully address this issue, we want to introduce adaptive rate limit to bulk request. One example of adaptive control algorithm is AIMD(additive-increase/multiplicative-decrease), which is used for TCP congestion control (ref). By utilizing this kind of algorithm, each node will try to adjust their traffic by gradually increase the call rate and quickly decrease when the request is throttled. (Each node will ideally end up similar call rate)

Netflix/concurrency-limits provides AIMD based concurrency limit and we might be able to utilize it for our implementation.

What alternatives have you considered?

Centralized rate limiter using DynamoDB, Redis, etc.
Driver node to decide rate limit for each worker node

Do you have any additional context?
n/a

The text was updated successfully, but these errors were encountered:

ykmr1224 added enhancement New feature or request untriaged labels Sep 10, 2024

ykmr1224 mentioned this issue Sep 12, 2024

[FEATURE] Dynamically adjust rate limit for OpenSearch requests #570

Closed

dai-chen removed the untriaged label Sep 17, 2024

dai-chen mentioned this issue Sep 17, 2024

[FEATURE] Performance and Scalability Enhancements for Flint Index #365

Open

dai-chen added the DataSource:OpenSearch label Sep 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Avoid throttling when writing data to index #640

[FEATURE] Avoid throttling when writing data to index #640

ykmr1224 commented Sep 10, 2024

[FEATURE] Avoid throttling when writing data to index #640

[FEATURE] Avoid throttling when writing data to index #640

Comments

ykmr1224 commented Sep 10, 2024