You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Lagoon has created backup schedules for two different environments of the same project. Here is the diff showing an identical schedule in two different namespaces:
These schedules cause two checks to run at the same time (one in each namespace). This sometimes works if the repository is small (so the check is quick) but often doesn't work because restic takes an exclusive lock on the repository during a check. So if the check that wins the race to get a lock on the repository takes longer than the retry time of the other check jobs (which seems to be 5x over ~2 minutes) the other checks always fail.
Maybe Lagoon should only add a check schedule to a single environment (the first production env in the project?). And then prune on the first of each of development and production since they can have differing policies.
Somehow ensure that check and prune for each env runs at different times. Though it seems difficult and pointless since the command only needs to run once per repository?
Lagoon maintains a special namespace per-project or per-cluster which has a Schedule for each repository containing the prod and dev check and prune schedules?
Something else???
The text was updated successfully, but these errors were encountered:
Lagoon has created backup schedules for two different environments of the same project. Here is the diff showing an identical schedule in two different namespaces:
These schedules cause two checks to run at the same time (one in each namespace). This sometimes works if the repository is small (so the check is quick) but often doesn't work because restic takes an exclusive lock on the repository during a check. So if the check that wins the race to get a lock on the repository takes longer than the retry time of the other check jobs (which seems to be 5x over ~2 minutes) the other checks always fail.
The same problem exists with
prune
schedules because that command also takes an exclusive lock.Backups are not affected because that command only take an append lock.
Ideas for solving the issue:
check
schedule to a single environment (the first production env in the project?). And thenprune
on the first of each ofdevelopment
andproduction
since they can have differing policies.check
andprune
for each env runs at different times. Though it seems difficult and pointless since the command only needs to run once per repository?Schedule
for each repository containing the prod and devcheck
andprune
schedules?The text was updated successfully, but these errors were encountered: