[BUG] Exclude_ip to remove search nodes not working properly #15347

sivatarunp · 2024-08-22T06:22:00Z

Describe the bug

We have an OpenSearch 2.13.0 cluster which use searchable snapshots. We observed an issue where when we wanted to exclude few search nodes using cluster.routing.allocation.exclude._ip setting, the shards are stuck in relocation stage .

The cluster also seemed to have issues wrt ism polices not being triggered, and any restore operations hanging. Once the setting was removed, things came back to normal. Is it expected behaviour wrt search nodes? If so what is the ideal way to scale up and scale down the search nodes ?

Related component

Search:Searchable Snapshots

To Reproduce

boot an OS cluster 2.13.0 version. Have around 40 search nodes
index some data. Take snapshot and restore on search nodes.
ensure you have enough data and shards > 400 per node
exclude 10 search nodes

Expected behavior

Nodes should have been excluded and shards should have been relocated, without any issues with ISM/other cluster activities

Additional Details

Plugins
Please list all plugins currently enabled.

Screenshots
If applicable, add screenshots to help explain your problem.

Host/Environment (please complete the following information):

OS: [e.g. iOS]
Version [e.g. 22]

Additional context
Add any other context about the problem here.

The text was updated successfully, but these errors were encountered:

mch2 · 2024-09-04T16:09:12Z

Thanks for reporting @sivatarunp, we will try and reproduce on our end. If you could can you please provide the output to /_cat/shards here and /_cat/recovery?active_only=true when the relocation is stuck? Also how many shards per index are you configuring? Thanks.

sivatarunp added bug Something isn't working untriaged labels Aug 22, 2024

github-actions bot added the Search:Searchable Snapshots label Aug 22, 2024

github-project-automation bot added this to Search Project Board Aug 22, 2024

github-project-automation bot moved this to 🆕 New in Search Project Board Aug 22, 2024

sandeshkr419 added the ShardManagement:Routing label Aug 28, 2024

github-project-automation bot added this to Shard Management Project Board Aug 28, 2024

github-project-automation bot moved this to 🆕 New in Shard Management Project Board Aug 28, 2024

mch2 removed the untriaged label Sep 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Exclude_ip to remove search nodes not working properly #15347

[BUG] Exclude_ip to remove search nodes not working properly #15347

sivatarunp commented Aug 22, 2024

mch2 commented Sep 4, 2024

[BUG] Exclude_ip to remove search nodes not working properly #15347

[BUG] Exclude_ip to remove search nodes not working properly #15347

Comments

sivatarunp commented Aug 22, 2024

Describe the bug

Related component

To Reproduce

Expected behavior

Additional Details

mch2 commented Sep 4, 2024