-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
remove locks when finishing a job #1637
Conversation
also: add the description in readme, and fix test value
ArgoCD Diff for commit
|
Legend | Status |
---|---|
✅ | The app is synced in ArgoCD, and diffs you see are solely from this PR. |
The app is out-of-sync in ArgoCD, and the diffs you see include those changes plus any from this PR. | |
🛑 | There was an error generating the ArgoCD diffs due to changes in this PR. |
Codecov ReportPatch coverage:
Additional details and impacted files@@ Coverage Diff @@
## main #1637 +/- ##
==========================================
- Coverage 99.37% 92.24% -7.14%
==========================================
Files 6 77 +71
Lines 161 5464 +5303
==========================================
+ Hits 160 5040 +4880
- Misses 1 424 +423
Flags with carried forward coverage won't be shown. Click here to find out more.
☔ View full report in Codecov by Sentry. |
should fix the issue with old remaining locks, when a job is killed (too long job, after 40 minutes) while it's uploading files to the Hub (lock created with git_branch()).
research notes: to get all the old remaining locks:
Today: returns 64 entries, eg:
They have a non-null owner, and have been created with lock.git_branch() (ie: during parquet or duckdb index upload). Possibly: when a pod was killed (sync? zombie killer?)
|
I'll manually remove the owner from the existing old lock entries after deploying with:
|
|
should fix the issue with old remaining locks, when a job is killed (too long job, after 40 minutes) while it's uploading files to the Hub (lock created with git_branch()).
also: add environment variables in docker compose and helm, add the description in readme, and fix test value