-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Reroute FetchData failures - on Replica shards in AsyncBatch mode #13705
Comments
Based on the additional logs from impacted cluster, it is evident that ShardId is present across mulitple batches The following Shards are part of two batches [.opensearch-sap-log-types-config][0] Batches BatchID hFARfI8BSntRPPWZovof
Hence when Shard
|
#13710 - fixed here |
Describe the bug
When AsyncBatch operation is enabled, the following stack trace is observed when node joins the cluster.
The error is not consistently reproducible always and happens in random. The cluster eventually recovers to green since Reroute operation is reattempted on failures.
Related component
Cluster Manager
To Reproduce
cluster.allocator.existing_shards_allocator.batch_enabled: true
cluster.allocator.gateway.batch_size:10
Expected behavior
Reroute operation should not fail and the replica shard needs to be assigned to only one Fetch Batch
Additional Details
Plugins
Please list all plugins currently enabled.
Screenshots
If applicable, add screenshots to help explain your problem.
Host/Environment (please complete the following information):
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: