[BUG] [Potential Issue] Cluster state reponse handling thread blocked #12820
Labels
bug
Something isn't working
Cluster Manager
flaky-test
Random test failure that succeeds on second run
Describe the bug
Observed this during gradle-check run for #12813 (comment)
https://build.ci.opensearch.org/job/gradle-check/35542/console
3 generic threads were blocked for processing publication response (it was 3 node cluster in test)
All the threads were probably waiting on below mutex :
OpenSearch/server/src/main/java/org/opensearch/cluster/coordination/Coordinator.java
Lines 1376 to 1392 in f3d2bee
Related component
Cluster Manager
To Reproduce
Expected behavior
Investigate which code path was holding mutex and if it can be optimized (lock duration for code path which was holding it). Right now, it is not clear for how long the threads were blocked.
Additional Details
No response
The text was updated successfully, but these errors were encountered: