-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fail fast when builtinbackup fails to restore a single file #16856
Conversation
Signed-off-by: Florent Poinsard <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Review ChecklistHello reviewers! 👋 Please follow this checklist when reviewing this Pull Request. General
Tests
Documentation
New flags
If a workflow is added or modified:
Backward compatibility
|
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #16856 +/- ##
==========================================
+ Coverage 69.43% 69.45% +0.01%
==========================================
Files 1571 1571
Lines 203021 203099 +78
==========================================
+ Hits 140970 141056 +86
+ Misses 62051 62043 -8 ☔ View full report in Codecov by Sentry. |
Signed-off-by: Florent Poinsard <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is it not possible to write an e2e or unit test for this? If not for some reason, then at least we have the manual test. That's my only real concern, the other things are minor nits/suggestions.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Approving now that we discussed future testing plans/additions. So we don’t have to do that here. Everything else was nits.
Co-authored-by: Matt Lord <[email protected]> Signed-off-by: Florent Poinsard <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Agree with Matt that we need an e2e test for this. However, since
- existing tests are passing, i.e. the PR is not causing a regression that we can see
- manual test is passing
I'll go ahead and approve it.
Signed-off-by: Florent Poinsard <[email protected]> Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: Matt Lord <[email protected]> Signed-off-by: Florent Poinsard <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]> Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: Matt Lord <[email protected]> Signed-off-by: Florent Poinsard <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]> Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: Matt Lord <[email protected]> Signed-off-by: Florent Poinsard <[email protected]>
… file (#16856) (#16868) Signed-off-by: Florent Poinsard <[email protected]> Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> Co-authored-by: Matt Lord <[email protected]>
… file (#16856) (#16867) Signed-off-by: Florent Poinsard <[email protected]> Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> Co-authored-by: Matt Lord <[email protected]> Co-authored-by: Florent Poinsard <[email protected]>
… file (#16856) (#16866) Signed-off-by: Florent Poinsard <[email protected]> Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> Co-authored-by: Matt Lord <[email protected]> Co-authored-by: Florent Poinsard <[email protected]>
* [release-19.0] Bump to `v19.0.5-SNAPSHOT` after the `v19.0.4` release (#15889) Signed-off-by: Andres Taylor <[email protected]> * [release-19.0] fix: handle info_schema routing (#15899) (#15906) Signed-off-by: Andres Taylor <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] Update VTAdmin build script (#15839) (#15850) Signed-off-by: notfelineit <[email protected]> Signed-off-by: <> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: Frances Thai <[email protected]> * [release-19.0] Update env.sh so that is does not error when running on Mac (#15835) (#15915) Signed-off-by: bddicken <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] fix: derived table join column expression to be part of add join predicate on rewrite (#15956) (#15960) Signed-off-by: Harshit Gangal <[email protected]> Signed-off-by: Andres Taylor <[email protected]> Co-authored-by: Harshit Gangal <[email protected]> Co-authored-by: Andres Taylor <[email protected]> * [release-19.0] fix: insert on duplicate update to add list argument in the bind variables map (#15961) (#15967) Signed-off-by: Harshit Gangal <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: Harshit Gangal <[email protected]> * [release-19.0] test: Cleaner plan tests output (#15922) (#15964) Signed-off-by: Andres Taylor <[email protected]> * [release-19.0] connpool: Allow time out during shutdown (#15979) (#16003) Signed-off-by: Vicent Marti <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] fix: remove keyspace when merging subqueries (#16019) (#16027) Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] Add DCO workflow (#16052) (#16056) Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] Upgrade the Golang version to `go1.22.4` (#16061) Signed-off-by: GitHub <[email protected]> Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: frouioui <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> * [release-19.0] Remove DCO workaround (#16087) (#16091) Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] Do not load table stats when booting `vttablet`. (#15715) (#16100) Signed-off-by: Arthur Schreiber <[email protected]> Co-authored-by: Arthur Schreiber <[email protected]> * [release-19.0] Add timeout to all the contexts used for RPC calls in vtorc (#15991) (#16103) Signed-off-by: Manan Gupta <[email protected]> * [release-19.0] Update braces package (#16115) (#16118) Signed-off-by: Frances Thai <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] fix: order by subquery planning (#16049) (#16132) Co-authored-by: Harshit Gangal <[email protected]> Co-authored-by: Andres Taylor <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> * [release-19.0] Fix `vtexplain` not handling `UNION` queries with `weight_string` results correctly. (#16129) (#16157) Signed-off-by: Arthur Schreiber <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: Arthur Schreiber <[email protected]> * Run more test on release-19 branch (#16152) Signed-off-by: Harshit Gangal <[email protected]> * [release-19.0] Fix flakiness in `vtexplain` unit test case. (#16159) (#16167) Signed-off-by: Arthur Schreiber <[email protected]> Co-authored-by: Arthur Schreiber <[email protected]> * [release-19.0] Online DDL shadow table: rename referenced table name in self referencing FK (#16205) (#16207) Signed-off-by: Shlomi Noach <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] Fix flaky tests that use vtcombo (#16178) (#16212) Signed-off-by: Manan Gupta <[email protected]> Co-authored-by: Manan Gupta <[email protected]> Co-authored-by: Manan Gupta <[email protected]> * [release-19.0] Handle Nullability for Columns from Outer Tables (#16174) (#16185) Co-authored-by: Andrés Taylor <[email protected]> * [release-19.0] VDiff CLI: Fix VDiff `show` bug (#16177) (#16198) Signed-off-by: Rohit Nayak <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] VReplication Workflow: set state correctly when restarting workflow streams in the copy phase (#16217) (#16222) Signed-off-by: Rohit Nayak <[email protected]> Co-authored-by: Rohit Nayak <[email protected]> * [release-19.0] vtctldclient: Apply (Shard | Keyspace| Table) Routing Rules commands don't work (#16096) (#16124) Signed-off-by: Rohit Nayak <[email protected]> Co-authored-by: Rohit Nayak <[email protected]> * [release-19.0] Fix vtgate crash in group concat (#16254) Signed-off-by: Manan Gupta <[email protected]> * [release-19.0] Fix Incorrect Optimization with LIMIT and GROUP BY (#16263) (#16267) Signed-off-by: Andres Taylor <[email protected]> Signed-off-by: Florent Poinsard <[email protected]> Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: Florent Poinsard <[email protected]> Co-authored-by: Andres Taylor <[email protected]> * [release-19.0] Fix the `v19.0.0` release notes and use the `vitess/lite` image for the MySQL container (#16282) (#16285) Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> * [release-19.0] VReplication: Properly handle target shards w/o a primary in Reshard (#16283) (#16291) Signed-off-by: Matt Lord <[email protected]> Co-authored-by: Matt Lord <[email protected]> * [release-19.0] CI: Fix for xtrabackup install failures (#16329) (#16332) Signed-off-by: Matt Lord <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: Matt Lord <[email protected]> * [release-19.0] Upgrade the Golang version to `go1.22.5` (#16322) Signed-off-by: GitHub <[email protected]> Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: frouioui <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> * [release-19.0] Fix the install dependencies script in Docker (#16340) (#16346) Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] planner: Handle ORDER BY inside derived tables (#16353) (#16359) Signed-off-by: Andres Taylor <[email protected]> Co-authored-by: Andres Taylor <[email protected]> * [release-19.0] Fix Join Predicate Cleanup Bug in Route Merging (#16386) (#16389) Signed-off-by: Andres Taylor <[email protected]> Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: Andres Taylor <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> * [release-19.0] fix issue with aggregation inside of derived tables (#16366) (#16384) Signed-off-by: Andres Taylor <[email protected]> Co-authored-by: Andrés Taylor <[email protected]> * [release-19.0] Use default schema reload config values when config file is empty (#16393) (#16410) Signed-off-by: Matt Lord <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] Fix subquery planning having an aggregation that is used in order by as long as we can merge it all into a single route (#16402) (#16407) Signed-off-by: Manan Gupta <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] Fix panic in schema tracker in presence of keyspace routing rules (#16383) (#16406) Signed-off-by: Manan Gupta <[email protected]> * [release-19] Vitess tester workflow (#16127) (#16418) Signed-off-by: Manan Gupta <[email protected]> Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> * [release-19.0] feat: add a LIMIT 1 on EXISTS subqueries to limit network overhead (#16153) (#16191) Signed-off-by: Andres Taylor <[email protected]> Co-authored-by: Andrés Taylor <[email protected]> * [release-19.0] Code Freeze for `v19.0.5` (#16448) Signed-off-by: Andres Taylor <[email protected]> * [release-19.0] Release of `v19.0.5` (#16450) Signed-off-by: Andres Taylor <[email protected]> * [release-19.0] Bump to `v19.0.6-SNAPSHOT` after the `v19.0.5` release (#16456) Signed-off-by: Andres Taylor <[email protected]> * [release-19.0] fix: reference table join merge (#16488) (#16496) Signed-off-by: Harshit Gangal <[email protected]> Signed-off-by: Andres Taylor <[email protected]> Co-authored-by: Harshit Gangal <[email protected]> Co-authored-by: Andres Taylor <[email protected]> * [release-19.0] Improve the queries upgrade/downgrade CI workflow by using same test code version as binary (#16494) (#16501) Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> * [release-19.0] bugfix: don't treat join predicates as filter predicates (#16472) (#16474) Signed-off-by: Andres Taylor <[email protected]> Co-authored-by: Andrés Taylor <[email protected]> * [release-19.0] VTAdmin: Upgrade websockets js package (#16504) (#16512) Signed-off-by: Matt Lord <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: Matt Lord <[email protected]> * [release-19.0] bugfix: Allow cross-keyspace joins (#16520) (#16523) Signed-off-by: Andres Taylor <[email protected]> Co-authored-by: Andrés Taylor <[email protected]> * [release-19.0] simplify merging logic (#16525) (#16532) Signed-off-by: Andres Taylor <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] Fix: Offset planning in hash joins (#16540) (#16551) Signed-off-by: Manan Gupta <[email protected]> Co-authored-by: Manan Gupta <[email protected]> Co-authored-by: Manan Gupta <[email protected]> * [release-19.0] Fix `RemoveTablet` during `TabletExternallyReparented` causing connection issues (#16371) (#16567) Signed-off-by: Arthur Schreiber <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * v19 backport: Throttler/vreplication: fix app name used by VPlayer (#16578) (#16580) Signed-off-by: Shlomi Noach <[email protected]> * [release-19.0] Upgrade the Golang version to `go1.22.6` (#16543) Signed-off-by: GitHub <[email protected]> Signed-off-by: Shlomi Noach <[email protected]> Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: frouioui <[email protected]> Co-authored-by: Shlomi Noach <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> * v19 backport: Online DDL: avoid SQL's `CONVERT(...)`, convert programmatically if needed (#16603) Signed-off-by: Shlomi Noach <[email protected]> * [release-19.0] Remove mysql57/percona57 bootstrap images (#16620) (#16622) Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> * [release-19.0] Fix query plan cache misses metric (#16562) (#16627) Signed-off-by: shanth96 <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] VReplication workflows: retry "wrong tablet type" errors (#16645) (#16652) Signed-off-by: Rohit Nayak <[email protected]> Co-authored-by: Rohit Nayak <[email protected]> Co-authored-by: Rohit Nayak <[email protected]> * [release-19.0] VStream API: validate that last PK has fields defined (#16478) (#16486) Signed-off-by: Rohit Nayak <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: Rohit Nayak <[email protected]> * [release-19.0] Update micromatch to 4.0.8 (#16660) (#16666) Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] Replace ErrorContains checks with Error checks before running upgrade downgrade (#16700) Signed-off-by: Manan Gupta <[email protected]> * [release-19.0] JSON Encoding: Use Type_RAW for marshalling json (#16637) (#16681) Signed-off-by: Rohit Nayak <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: Rohit Nayak <[email protected]> * [release-19.0] FindErrantGTIDs: superset is not an errant GTID situation (#16725) (#16728) Signed-off-by: deepthi <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] Move from 4-cores larger runners to `ubuntu-latest` (#16714) (#16717) Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> * [release-19.0] Upgrade the Golang version to `go1.22.7` (#16721) Signed-off-by: GitHub <[email protected]> Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: frouioui <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> * [release-19.0] Code Freeze for `v19.0.6` (#16745) Signed-off-by: Rohit Nayak <[email protected]> * [release-19.0] Release of `v19.0.6` (#16747) Signed-off-by: Rohit Nayak <[email protected]> * [release-19.0] Bump to `v19.0.7-SNAPSHOT` after the `v19.0.6` release (#16753) Signed-off-by: Rohit Nayak <[email protected]> * [release-19.0] Remove mysql57 from docker images (#16763) Signed-off-by: Florent Poinsard <[email protected]> * [release-19.0] VTAdmin: Address security vuln in path-to-regexp node pkg (#16770) (#16772) Signed-off-by: Matt Lord <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: Matt Lord <[email protected]> * Backport: Fix ACL checks for CTEs (#16642) (#16776) Signed-off-by: Manan Gupta <[email protected]> Signed-off-by: Andres Taylor <[email protected]> Co-authored-by: Manan Gupta <[email protected]> * [release-19.0] VTAdmin: Fix serve-handler's path-to-regexp dep and add default schema refresh (#16778) (#16783) Signed-off-by: Matt Lord <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: Matt Lord <[email protected]> * [release-19.0] Bump com.google.protobuf:protobuf-java from 3.24.3 to 3.25.5 in /java (#16809) (#16837) Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * [release-19.0] VTAdmin: Upgrade deps to address security vulns (#16843) (#16846) Signed-off-by: Matt Lord <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: Matt Lord <[email protected]> * [release-19.0] Support passing filters to `discovery.NewHealthCheck(...)` (#16170) (#16871) Signed-off-by: Tim Vaillancourt <[email protected]> * [release-19.0] Fail fast when builtinbackup fails to restore a single file (#16856) (#16867) Signed-off-by: Florent Poinsard <[email protected]> Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> Co-authored-by: Matt Lord <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> * [release-19.0] Upgrade Golang to 1.22.8 (#16895) Signed-off-by: Florent Poinsard <[email protected]> * [release-19.0] VTTablet: smartconnpool: notify all expired waiters (#16897) (#16901) Signed-off-by: Brendan Dougherty <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] Fix race in `replicationLagModule` of `go/vt/throttle` (#16078) (#16899) Signed-off-by: Tim Vaillancourt <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: Tim Vaillancourt <[email protected]> * [release-19.0] Bump commons-io:commons-io from 2.7 to 2.14.0 in /java (#16889) (#16930) Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * [release-19.0] fixes bugs around expression precedence and LIKE (#16934 & #16649) (#16945) Signed-off-by: Andres Taylor <[email protected]> Signed-off-by: Manan Gupta <[email protected]> Co-authored-by: Andrés Taylor <[email protected]> Co-authored-by: Manan Gupta <[email protected]> Co-authored-by: Manan Gupta <[email protected]> * [release-19.0] Flaky test fixes (#16940) (#16958) Signed-off-by: Manan Gupta <[email protected]> Co-authored-by: Manan Gupta <[email protected]> * [release-19.0] fix: route engine to handle column truncation for execute after lookup (#16981) (#16984) Signed-off-by: Harshit Gangal <[email protected]> Co-authored-by: Harshit Gangal <[email protected]> * [release-19.0] bugfix: add HAVING columns inside derived tables (#16976) (#16978) Signed-off-by: Andres Taylor <[email protected]> Co-authored-by: Andrés Taylor <[email protected]> * [release-19.0] Fix deadlock between health check and topology watcher (#16995) (#17008) Signed-off-by: Manan Gupta <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] Add support for `MultiEqual` opcode for lookup vindexes. (#16975) (#17039) Signed-off-by: Arthur Schreiber <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> * [release-19.0] bugfix: treat EXPLAIN like SELECT (#17054) (#17056) Signed-off-by: Andres Taylor <[email protected]> Co-authored-by: Andrés Taylor <[email protected]> * [release-19.0] Delegate Column Availability Checks to MySQL for Single-Route Queries (#17077) (#17085) Signed-off-by: Harshit Gangal <[email protected]> Signed-off-by: Andres Taylor <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: Andres Taylor <[email protected]> Co-authored-by: Harshit Gangal <[email protected]> * Bugfix for Panic on Joined Queries with Non-Authoritative Tables in Vitess 19.0 (#17103) Signed-off-by: Andres Taylor <[email protected]> * [release-19.0] Improve Schema Engine's TablesWithSize80 query (#17066) (#17089) Signed-off-by: Shlomi Noach <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: Shlomi Noach <[email protected]> * [release-19.0] Fix unreachable errors when taking a backup (#17062) (#17110) Signed-off-by: Florent Poinsard <[email protected]> Signed-off-by: Florent Poinsard <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> Co-authored-by: Matt Lord <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> * [release-19.0] Code Freeze for `v19.0.7` (#17148) Signed-off-by: Rohit Nayak <[email protected]> Co-authored-by: Rohit Nayak <[email protected]> * [release-19.0] Release of `v19.0.7` (#17149) Signed-off-by: Rohit Nayak <[email protected]> Co-authored-by: Rohit Nayak <[email protected]> * restore test conditional for v18 vttablet Signed-off-by: Tim Vaillancourt <[email protected]> * restore more test conditional for v18 binaries Signed-off-by: Tim Vaillancourt <[email protected]> * restore whitespace Signed-off-by: Tim Vaillancourt <[email protected]> * Revert "[release-19.0] Improve the queries upgrade/downgrade CI workflow by using same test code version as binary (#16494) (#16501)" This reverts commit 25a80ac. * add missing table from cleanup Signed-off-by: Tim Vaillancourt <[email protected]> --------- Signed-off-by: Andres Taylor <[email protected]> Signed-off-by: notfelineit <[email protected]> Signed-off-by: <> Signed-off-by: bddicken <[email protected]> Signed-off-by: Harshit Gangal <[email protected]> Signed-off-by: Vicent Marti <[email protected]> Signed-off-by: Florent Poinsard <[email protected]> Signed-off-by: GitHub <[email protected]> Signed-off-by: Arthur Schreiber <[email protected]> Signed-off-by: Manan Gupta <[email protected]> Signed-off-by: Frances Thai <[email protected]> Signed-off-by: Shlomi Noach <[email protected]> Signed-off-by: Rohit Nayak <[email protected]> Signed-off-by: Florent Poinsard <[email protected]> Signed-off-by: Matt Lord <[email protected]> Signed-off-by: shanth96 <[email protected]> Signed-off-by: deepthi <[email protected]> Signed-off-by: dependabot[bot] <[email protected]> Signed-off-by: Tim Vaillancourt <[email protected]> Signed-off-by: Brendan Dougherty <[email protected]> Co-authored-by: Andrés Taylor <[email protected]> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: Frances Thai <[email protected]> Co-authored-by: Harshit Gangal <[email protected]> Co-authored-by: vitess-bot <[email protected]> Co-authored-by: frouioui <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> Co-authored-by: Arthur Schreiber <[email protected]> Co-authored-by: Manan Gupta <[email protected]> Co-authored-by: Manan Gupta <[email protected]> Co-authored-by: Rohit Nayak <[email protected]> Co-authored-by: Florent Poinsard <[email protected]> Co-authored-by: Matt Lord <[email protected]> Co-authored-by: Shlomi Noach <[email protected]> Co-authored-by: Rohit Nayak <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Description
This PR fixes #16855. Since each file is restored concurrently, we need to cancel all the restore at once as soon as a goroutine fails. This way we prevent the restore process to take forever and stall.
I do not have a programatic E2E test for this PR as it was way to flaky even locally. However, I have described a step-by-step reproduction process on #16855.
This is a bug all the way from 18 to main, backporting to all branches.
Logs
Before
On the first line, we detect the error. But we continue restoring all the ongoing file after that, without canceling them.
After
Here we can see that current/ongoing restore are getting canceled. Besides file 72, which I think is due to concurrency, we log the error (first line) from the stderr of
ztsd
but there is enough time until we cancel all the context for72
to begin and finish. But once we detect the error at the top level loop inrestoreFiles
, all executions are canceled.