Add rate limiter for bulk request #567

ykmr1224 · 2024-08-15T16:20:57Z

Description

Add rate limiter for bulk request
Without rate limit, it would send request until the destination OpenSearch throttle requests (especially when NONE refresh policy is used).
Added config param spark.datasource.flint.write.bulkRequestRateLimitPerNode to specify the rate limit per node.
The default is set to 0, which means no rate limit.

Issues Resolved

List any issues this PR will resolve, e.g. Closes [...].

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Tomoyuki Morita <[email protected]>

flint-spark-integration/src/main/scala/org/apache/spark/sql/flint/config/FlintSparkConf.scala

Signed-off-by: Tomoyuki Morita <[email protected]>

penghuo

During Index/mv refreshing, can user adjust this value if the customer under pressure.
My concern is that this configuration may be helpful during the testing stage but not in the production environment. We should investigate methods to dynamically control the ingestion rate.

penghuo · 2024-08-16T16:14:55Z

docs/index.md

@@ -527,6 +527,7 @@ In the index mapping, the `_meta` and `properties`field stores meta and schema i
 - `spark.datasource.flint.write.batch_size`: "The number of documents written to Flint in a single batch request. Default value is Integer.MAX_VALUE.
 - `spark.datasource.flint.write.batch_bytes`: The approximately amount of data in bytes written to Flint in a single batch request. The actual data write to OpenSearch may more than it. Default value is 1mb. The writing process checks after each document whether the total number of documents (docCount) has reached batch_size or the buffer size has surpassed batch_bytes. If either condition is met, the current batch is flushed and the document count resets to zero.
 - `spark.datasource.flint.write.refresh_policy`: default value is false. valid values [NONE(false), IMMEDIATE(true), WAIT_UNTIL(wait_for)]
+- `spark.datasource.flint.write.bulkRequestRateLimitPerNode`: default value is 0, which disables rate limit.


0.1 means 1 request per 10 seconds? could u add more in doc.

It won't accept decimal value. We could support if we modify the rate limit period, but I think it would become complicated considering someone might specify a value like 1.23. If we want to reduce the traffic less than 1 request/sec, we can reduce the size of batch instead. It would reduce the actual number of request.

penghuo · 2024-08-16T16:18:24Z

flint-core/src/main/scala/org/opensearch/flint/core/storage/BulkRequestRateLimiterHolder.java

+
+  private BulkRequestRateLimiterHolder() {}
+
+  public synchronized static BulkRequestRateLimiter getBulkRequestRateLimiter(


does Holder necessary? move to BulkRequestRateLimiter?

As written in the comment, it is needed to use shared single instance, and make BulkRequestRateLimiter testable. If we directly make BulkRequestRateLimiter singleton, we cannot test it with different parameters.

penghuo · 2024-08-16T16:25:21Z

flint-core/src/main/java/org/opensearch/flint/core/RestHighLevelClientWrapper.java

-        return execute(OS_WRITE_OP_METRIC_PREFIX, () -> client.bulk(bulkRequest, options));
+      return execute(OS_WRITE_OP_METRIC_PREFIX, () -> {
+        try {
+          rateLimiter.acquirePermit();


Each bulk request contains multiple index request. if the throttle is on each index request, the bulk request limit may not help.

How does OpenSearch customer configure this paramater?

Rate limit on bulk request would limit the overall number of index request.

They can add it as an extra Spark parameter for now. We might want to add an attribute for Datasource so we can specify per datasource.

ykmr1224 · 2024-08-16T16:59:44Z

During Index/mv refreshing, can user adjust this value if the customer under pressure. My concern is that this configuration may be helpful during the testing stage but not in the production environment. We should investigate methods to dynamically control the ingestion rate.

Let me take this as an item for long term solution. If we use token bucket solution with external state store(DynamoDB, etc.), we can adjust the limit dynamically.

Signed-off-by: Tomoyuki Morita <[email protected]>

ykmr1224 · 2024-08-16T18:26:37Z

Added [Experimental] to the doc.

Signed-off-by: Tomoyuki Morita <[email protected]> (cherry picked from commit 15ee355) Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

(cherry picked from commit 15ee355) Signed-off-by: Tomoyuki Morita <[email protected]> Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

Signed-off-by: Tomoyuki Morita <[email protected]> (cherry picked from commit 15ee355) Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

(cherry picked from commit 15ee355) Signed-off-by: Tomoyuki Morita <[email protected]> Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

Add rate limiter for bulk request

0a8e02d

Signed-off-by: Tomoyuki Morita <[email protected]>

ykmr1224 marked this pull request as ready for review August 15, 2024 16:56

ykmr1224 requested review from dai-chen, rupal-bq, vmmusings, penghuo, seankao-az, anirudha, kaituo and YANG-DB as code owners August 15, 2024 16:56

ykmr1224 added 2 commits August 15, 2024 14:22

Fix unit test issue

7ead4a2

Signed-off-by: Tomoyuki Morita <[email protected]>

Fix scala style issue

1d493b2

Signed-off-by: Tomoyuki Morita <[email protected]>

ykmr1224 commented Aug 15, 2024

View reviewed changes

flint-spark-integration/src/main/scala/org/apache/spark/sql/flint/config/FlintSparkConf.scala Outdated Show resolved Hide resolved

ykmr1224 added 4 commits August 15, 2024 15:50

Remove unneeded logging

25d3f97

Signed-off-by: Tomoyuki Morita <[email protected]>

Revert unneeded change

30bc945

Signed-off-by: Tomoyuki Morita <[email protected]>

Revert unrelated change

0a71be4

Signed-off-by: Tomoyuki Morita <[email protected]>

Revert unrelated change

8576224

Signed-off-by: Tomoyuki Morita <[email protected]>

ykmr1224 force-pushed the aoss/rate-limiter branch from a4aa985 to 8576224 Compare August 16, 2024 16:13

penghuo reviewed Aug 16, 2024

View reviewed changes

Fix description in index.md

54d8cf1

Signed-off-by: Tomoyuki Morita <[email protected]>

penghuo approved these changes Aug 16, 2024

View reviewed changes

Add Experimental to doc

406f57e

Signed-off-by: Tomoyuki Morita <[email protected]>

ykmr1224 mentioned this pull request Aug 16, 2024

[FEATURE] Dynamically adjust rate limit for OpenSearch requests #570

Closed

penghuo assigned ykmr1224 Aug 16, 2024

penghuo added 0.6 backport 0.5 labels Aug 16, 2024

penghuo merged commit 15ee355 into opensearch-project:main Aug 16, 2024
6 checks passed

opensearch-trigger-bot bot mentioned this pull request Aug 16, 2024

[Backport 0.5] Add rate limiter for bulk request #571

Merged

ykmr1224 mentioned this pull request Aug 16, 2024

Retry bulk request to OpenSearch #572

Merged

penghuo added the backport 0.5-nexus label Aug 16, 2024

opensearch-trigger-bot bot mentioned this pull request Aug 16, 2024

[Backport 0.5-nexus] Add rate limiter for bulk request #574

Merged

ykmr1224 mentioned this pull request Sep 10, 2024

[FEATURE] Avoid throttling when writing data to index #640

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add rate limiter for bulk request #567

Add rate limiter for bulk request #567

ykmr1224 commented Aug 15, 2024

penghuo left a comment

penghuo Aug 16, 2024

ykmr1224 Aug 16, 2024 •

edited

Loading

penghuo Aug 16, 2024

ykmr1224 Aug 16, 2024

penghuo Aug 16, 2024

ykmr1224 Aug 16, 2024

ykmr1224 commented Aug 16, 2024

ykmr1224 commented Aug 16, 2024


		private BulkRequestRateLimiterHolder() {}

		public synchronized static BulkRequestRateLimiter getBulkRequestRateLimiter(

Add rate limiter for bulk request #567

Add rate limiter for bulk request #567

Conversation

ykmr1224 commented Aug 15, 2024

Description

Issues Resolved

penghuo left a comment

Choose a reason for hiding this comment

penghuo Aug 16, 2024

Choose a reason for hiding this comment

ykmr1224 Aug 16, 2024 • edited Loading

Choose a reason for hiding this comment

penghuo Aug 16, 2024

Choose a reason for hiding this comment

ykmr1224 Aug 16, 2024

Choose a reason for hiding this comment

penghuo Aug 16, 2024

Choose a reason for hiding this comment

ykmr1224 Aug 16, 2024

Choose a reason for hiding this comment

ykmr1224 commented Aug 16, 2024

ykmr1224 commented Aug 16, 2024

ykmr1224 Aug 16, 2024 •

edited

Loading