-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Remote Store] Add extra buffer before deleting older generations of translog #10817
Conversation
Gradle Check (Jenkins) Run Completed with:
|
Compatibility status:Checks if related components are compatible with change c411a25 Incompatible componentsIncompatible components: [https://github.com/opensearch-project/cross-cluster-replication.git] Skipped componentsCompatible componentsCompatible components: [https://github.com/opensearch-project/security.git, https://github.com/opensearch-project/alerting.git, https://github.com/opensearch-project/index-management.git, https://github.com/opensearch-project/anomaly-detection.git, https://github.com/opensearch-project/job-scheduler.git, https://github.com/opensearch-project/sql.git, https://github.com/opensearch-project/asynchronous-search.git, https://github.com/opensearch-project/common-utils.git, https://github.com/opensearch-project/observability.git, https://github.com/opensearch-project/k-nn.git, https://github.com/opensearch-project/reporting.git, https://github.com/opensearch-project/security-analytics.git, https://github.com/opensearch-project/custom-codecs.git, https://github.com/opensearch-project/performance-analyzer.git, https://github.com/opensearch-project/opensearch-oci-object-storage.git, https://github.com/opensearch-project/ml-commons.git, https://github.com/opensearch-project/performance-analyzer-rca.git, https://github.com/opensearch-project/geospatial.git, https://github.com/opensearch-project/notifications.git, https://github.com/opensearch-project/neural-search.git] |
dd548fb
to
8d45cd2
Compare
Gradle Check (Jenkins) Run Completed with:
|
Codecov Report
@@ Coverage Diff @@
## main #10817 +/- ##
============================================
- Coverage 71.31% 71.27% -0.05%
- Complexity 58671 58706 +35
============================================
Files 4860 4869 +9
Lines 276335 276450 +115
Branches 40198 40198
============================================
- Hits 197068 197032 -36
- Misses 62803 62991 +188
+ Partials 16464 16427 -37
|
Gradle Check (Jenkins) Run Completed with:
|
Gradle Check (Jenkins) Run Completed with:
|
Lets create an issue or tag an existing one around using an approach where we can prevent deletion of translog from remote only for the peer recovery case. |
server/src/main/java/org/opensearch/index/translog/RemoteFsTranslog.java
Outdated
Show resolved
Hide resolved
Gradle Check (Jenkins) Run Completed with:
|
Gradle Check (Jenkins) Run Completed with:
|
server/src/test/java/org/opensearch/index/translog/RemoteFsTranslogTests.java
Outdated
Show resolved
Hide resolved
server/src/test/java/org/opensearch/index/translog/RemoteFsTranslogTests.java
Show resolved
Hide resolved
server/src/test/java/org/opensearch/index/translog/RemoteFsTranslogTests.java
Show resolved
Hide resolved
Gradle Check (Jenkins) Run Completed with:
|
Gradle Check (Jenkins) Run Completed with:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm
Gradle Check (Jenkins) Run Completed with:
|
Gradle Check (Jenkins) Run Completed with:
|
… translog files Signed-off-by: Gaurav Bafna <[email protected]>
Signed-off-by: Gaurav Bafna <[email protected]>
Signed-off-by: Gaurav Bafna <[email protected]>
@gbbafna @sachinpkale I have rebased and forced push to this branch for build to succeed. |
Gradle Check (Jenkins) Run Completed with:
|
…translog (#10817) --------- Signed-off-by: Gaurav Bafna <[email protected]> (cherry picked from commit 218a2ef) Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
…translog (#10817) (#10850) --------- (cherry picked from commit 218a2ef) Signed-off-by: Gaurav Bafna <[email protected]> Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
…translog (opensearch-project#10817) --------- Signed-off-by: Gaurav Bafna <[email protected]> Signed-off-by: Shivansh Arora <[email protected]>
Description
Even after #9191 , we are seeing recovery failures on primary relocations.
This is due to the fact that older primary is continuously uploading and deleting from remote translog. So it can happen that even with retries , newer primary is not able to complete the download of all the files.
This PR adds some buffer before deleting the files from remote translog .
Related Issues
Resolves #[Issue number to be closed when this PR is merged]
Check List
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.