Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fail fast when builtinbackup fails to restore a single file #16856

Merged
merged 5 commits into from
Sep 30, 2024

Conversation

frouioui
Copy link
Member

@frouioui frouioui commented Sep 26, 2024

Description

This PR fixes #16855. Since each file is restored concurrently, we need to cancel all the restore at once as soon as a goroutine fails. This way we prevent the restore process to take forever and stall.

I do not have a programatic E2E test for this PR as it was way to flaky even locally. However, I have described a step-by-step reproduction process on #16855.

This is a bug all the way from 18 to main, backporting to all branches.

Logs

Before

On the first line, we detect the error. But we continue restoring all the ongoing file after that, without canceling them.

commerce/0 (zone1-0000000102): time:{seconds:1727389202 nanoseconds:504998000} file:"backup.go" line:519 value:"decompressor stderr: /*stdin*\\ : Read error (39) : premature end "
commerce/0 (zone1-0000000102): time:{seconds:1727389202 nanoseconds:524027000} file:"builtinbackupengine.go" line:1145 value:"closing decompressor"
commerce/0 (zone1-0000000102): time:{seconds:1727389202 nanoseconds:524051000} file:"builtinbackupengine.go" line:779 value:"Completed restoring  \"173\""
commerce/0 (zone1-0000000102): time:{seconds:1727389202 nanoseconds:564330000} file:"builtinbackupengine.go" line:1145 value:"closing decompressor"
commerce/0 (zone1-0000000102): time:{seconds:1727389202 nanoseconds:893811000} file:"builtinbackupengine.go" line:784 value:"restoring  \"34\": 1421.60kb"
commerce/0 (zone1-0000000102): time:{seconds:1727389202 nanoseconds:993136000} file:"builtinbackupengine.go" line:784 value:"restoring  \"171\": 2780.91kb"
commerce/0 (zone1-0000000102): time:{seconds:1727389202 nanoseconds:994575000} file:"builtinbackupengine.go" line:784 value:"restoring  \"172\": 252.01kb"
commerce/0 (zone1-0000000102): time:{seconds:1727389203 nanoseconds:393780000} file:"builtinbackupengine.go" line:784 value:"restoring  \"34\": 1421.60kb"
commerce/0 (zone1-0000000102): time:{seconds:1727389203 nanoseconds:493087000} file:"builtinbackupengine.go" line:784 value:"restoring  \"171\": 2780.91kb"
commerce/0 (zone1-0000000102): time:{seconds:1727389203 nanoseconds:494524000} file:"builtinbackupengine.go" line:784 value:"restoring  \"172\": 252.01kb"
commerce/0 (zone1-0000000102): time:{seconds:1727389203 nanoseconds:601967000} file:"builtinbackupengine.go" line:779 value:"Completed restoring  \"171\""
commerce/0 (zone1-0000000102): time:{seconds:1727389203 nanoseconds:601978000} file:"builtinbackupengine.go" line:1145 value:"closing decompressor"
commerce/0 (zone1-0000000102): time:{seconds:1727389203 nanoseconds:804764000} file:"builtinbackupengine.go" line:1145 value:"closing decompressor"
commerce/0 (zone1-0000000102): time:{seconds:1727389203 nanoseconds:804770000} file:"builtinbackupengine.go" line:779 value:"Completed restoring  \"34\""
commerce/0 (zone1-0000000102): time:{seconds:1727389203 nanoseconds:805104000} file:"restore.go" line:283 value:"Restore: got a restore manifest: <nil>, err=can't restore file 172 to vt_commerce/customer.ibd: hash mismatch for vt_commerce/customer.ibd, got 0ca874fd expected ea1dd5df\nfailed to restore files, waitForBackupInterval=0s"
E0926 16:20:03.818039   37724 main.go:56] rpc error: code = Unknown desc = TabletManager.RestoreFromBackup on zone1-0000000102: Can't restore backup: failed to restore files: can't restore file 172 to vt_commerce/customer.ibd: hash mismatch for vt_commerce/customer.ibd, got 0ca874fd expected ea1dd5df

After

Here we can see that current/ongoing restore are getting canceled. Besides file 72, which I think is due to concurrency, we log the error (first line) from the stderr of ztsd but there is enough time until we cancel all the context for 72 to begin and finish. But once we detect the error at the top level loop in restoreFiles, all executions are canceled.

commerce/0 (zone1-0000000102): time:{seconds:1727391201 nanoseconds:213399000} file:"backup.go" line:519 value:"decompressor stderr: /*stdin*\\ : Read error (39) : premature end "
commerce/0 (zone1-0000000102): time:{seconds:1727391201 nanoseconds:221890000} file:"builtinbackupengine.go" line:1171 value:"closing decompressor"
commerce/0 (zone1-0000000102): time:{seconds:1727391201 nanoseconds:221926000} file:"builtinbackupengine.go" line:795 value:"Completed restoring  \"162\""
commerce/0 (zone1-0000000102): time:{seconds:1727391201 nanoseconds:222180000} file:"builtinbackupengine.go" line:1077 value:"Copying file 76: performance_schema/events_stages_su_116.sdi"
commerce/0 (zone1-0000000102): time:{seconds:1727391201 nanoseconds:222465000} file:"compression.go" line:168 value:"Decompressing using external command: \"nice -n 19 zstd -c -d\""
commerce/0 (zone1-0000000102): time:{seconds:1727391201 nanoseconds:232724000} file:"builtinbackupengine.go" line:1171 value:"closing decompressor"
commerce/0 (zone1-0000000102): time:{seconds:1727391201 nanoseconds:232753000} file:"builtinbackupengine.go" line:795 value:"Completed restoring  \"76\""
commerce/0 (zone1-0000000102): time:{seconds:1727391201 nanoseconds:274510000} file:"builtinbackupengine.go" line:1171 value:"closing decompressor"
commerce/0 (zone1-0000000102): time:{seconds:1727391201 nanoseconds:274759000} file:"builtinbackupengine.go" line:792 value:"Canceled \"171\""
commerce/0 (zone1-0000000102): time:{seconds:1727391201 nanoseconds:274767000} file:"builtinbackupengine.go" line:792 value:"Canceled \"34\""
commerce/0 (zone1-0000000102): time:{seconds:1727391201 nanoseconds:274789000} file:"builtinbackupengine.go" line:792 value:"Canceled \"172\""
commerce/0 (zone1-0000000102): time:{seconds:1727391201 nanoseconds:279935000} file:"builtinbackupengine.go" line:1171 value:"closing decompressor"
commerce/0 (zone1-0000000102): time:{seconds:1727391201 nanoseconds:280043000} file:"builtinbackupengine.go" line:1171 value:"closing decompressor"
commerce/0 (zone1-0000000102): time:{seconds:1727391201 nanoseconds:280295000} file:"restore.go" line:283 value:"Restore: got a restore manifest: <nil>, err=can't restore file 172 to vt_commerce/customer.ibd: hash mismatch for vt_commerce/customer.ibd, got 048f1e72 expected 6e9641a7;can't restore file 171 to vt_commerce/corder.ibd: hash mismatch for vt_commerce/corder.ibd, got aaffb3c0 expected 2d504b92\nfailed to restore files, waitForBackupInterval=0s"
E0926 16:53:21.293970   73953 main.go:56] rpc error: code = Unknown desc = TabletManager.RestoreFromBackup on zone1-0000000102: Can't restore backup: failed to restore files: can't restore file 172 to vt_commerce/customer.ibd: hash mismatch for vt_commerce/customer.ibd, got 048f1e72 expected 6e9641a7;can't restore file 171 to vt_commerce/corder.ibd: hash mismatch for vt_commerce/corder.ibd, got aaffb3c0 expected 2d504b92

Copy link
Contributor

vitess-bot bot commented Sep 26, 2024

Review Checklist

Hello reviewers! 👋 Please follow this checklist when reviewing this Pull Request.

General

  • Ensure that the Pull Request has a descriptive title.
  • Ensure there is a link to an issue (except for internal cleanup and flaky test fixes), new features should have an RFC that documents use cases and test cases.

Tests

  • Bug fixes should have at least one unit or end-to-end test, enhancement and new features should have a sufficient number of tests.

Documentation

  • Apply the release notes (needs details) label if users need to know about this change.
  • New features should be documented.
  • There should be some code comments as to why things are implemented the way they are.
  • There should be a comment at the top of each new or modified test to explain what the test does.

New flags

  • Is this flag really necessary?
  • Flag names must be clear and intuitive, use dashes (-), and have a clear help text.

If a workflow is added or modified:

  • Each item in Jobs should be named in order to mark it as required.
  • If the workflow needs to be marked as required, the maintainer team must be notified.

Backward compatibility

  • Protobuf changes should be wire-compatible.
  • Changes to _vt tables and RPCs need to be backward compatible.
  • RPC changes should be compatible with vitess-operator
  • If a flag is removed, then it should also be removed from vitess-operator and arewefastyet, if used there.
  • vtctl command output order should be stable and awk-able.

@vitess-bot vitess-bot bot added NeedsBackportReason If backport labels have been applied to a PR, a justification is required NeedsDescriptionUpdate The description is not clear or comprehensive enough, and needs work NeedsIssue A linked issue is missing for this Pull Request NeedsWebsiteDocsUpdate What it says labels Sep 26, 2024
@frouioui frouioui added Type: Bug Component: Backup and Restore and removed NeedsDescriptionUpdate The description is not clear or comprehensive enough, and needs work NeedsWebsiteDocsUpdate What it says NeedsIssue A linked issue is missing for this Pull Request NeedsBackportReason If backport labels have been applied to a PR, a justification is required labels Sep 26, 2024
@github-actions github-actions bot added this to the v21.0.0 milestone Sep 26, 2024
@frouioui frouioui added Backport to: release-18.0 Backport to: release-19.0 Needs to be back ported to release-19.0 Backport to: release-20.0 Needs to be backport to release-20.0 labels Sep 26, 2024
Copy link

codecov bot commented Sep 26, 2024

Codecov Report

Attention: Patch coverage is 71.87500% with 9 lines in your changes missing coverage. Please review.

Project coverage is 69.45%. Comparing base (2e2b223) to head (a107193).
Report is 6 commits behind head on main.

Files with missing lines Patch % Lines
go/vt/mysqlctl/builtinbackupengine.go 71.87% 9 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main   #16856      +/-   ##
==========================================
+ Coverage   69.43%   69.45%   +0.01%     
==========================================
  Files        1571     1571              
  Lines      203021   203099      +78     
==========================================
+ Hits       140970   141056      +86     
+ Misses      62051    62043       -8     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Signed-off-by: Florent Poinsard <[email protected]>
Copy link
Contributor

@mattlord mattlord left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is it not possible to write an e2e or unit test for this? If not for some reason, then at least we have the manual test. That's my only real concern, the other things are minor nits/suggestions.

go/vt/mysqlctl/builtinbackupengine.go Outdated Show resolved Hide resolved
go/vt/mysqlctl/builtinbackupengine.go Outdated Show resolved Hide resolved
go/vt/mysqlctl/builtinbackupengine.go Outdated Show resolved Hide resolved
Copy link
Contributor

@mattlord mattlord left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approving now that we discussed future testing plans/additions. So we don’t have to do that here. Everything else was nits.

frouioui and others added 2 commits September 30, 2024 02:16
Co-authored-by: Matt Lord <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Copy link
Member

@deepthi deepthi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Agree with Matt that we need an e2e test for this. However, since

  • existing tests are passing, i.e. the PR is not causing a regression that we can see
  • manual test is passing

I'll go ahead and approve it.

@frouioui frouioui merged commit 5484439 into vitessio:main Sep 30, 2024
98 checks passed
@frouioui frouioui deleted the fix-restore-backup-error branch September 30, 2024 21:33
frouioui added a commit that referenced this pull request Oct 2, 2024
Signed-off-by: Florent Poinsard <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: Matt Lord <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
frouioui added a commit that referenced this pull request Oct 2, 2024
Signed-off-by: Florent Poinsard <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: Matt Lord <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
frouioui added a commit that referenced this pull request Oct 2, 2024
Signed-off-by: Florent Poinsard <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: Matt Lord <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
rohit-nayak-ps pushed a commit that referenced this pull request Oct 4, 2024
… file (#16856) (#16868)

Signed-off-by: Florent Poinsard <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>
Co-authored-by: Matt Lord <[email protected]>
rohit-nayak-ps pushed a commit that referenced this pull request Oct 4, 2024
… file (#16856) (#16867)

Signed-off-by: Florent Poinsard <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>
Co-authored-by: Matt Lord <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>
rohit-nayak-ps pushed a commit that referenced this pull request Oct 4, 2024
… file (#16856) (#16866)

Signed-off-by: Florent Poinsard <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>
Co-authored-by: Matt Lord <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>
timvaillancourt referenced this pull request in slackhq/vitess Nov 7, 2024
* [release-19.0] Bump to `v19.0.5-SNAPSHOT` after the `v19.0.4` release (#15889)

Signed-off-by: Andres Taylor <[email protected]>

* [release-19.0] fix: handle info_schema routing (#15899) (#15906)

Signed-off-by: Andres Taylor <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] Update VTAdmin build script (#15839) (#15850)

Signed-off-by: notfelineit <[email protected]>
Signed-off-by: <>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>
Co-authored-by: Frances Thai <[email protected]>

* [release-19.0] Update env.sh so that is does not error when running on Mac (#15835) (#15915)

Signed-off-by: bddicken <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] fix: derived table join column expression to be part of add join predicate on rewrite (#15956) (#15960)

Signed-off-by: Harshit Gangal <[email protected]>
Signed-off-by: Andres Taylor <[email protected]>
Co-authored-by: Harshit Gangal <[email protected]>
Co-authored-by: Andres Taylor <[email protected]>

* [release-19.0] fix: insert on duplicate update to add list argument in the bind variables map (#15961) (#15967)

Signed-off-by: Harshit Gangal <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>
Co-authored-by: Harshit Gangal <[email protected]>

* [release-19.0] test: Cleaner plan tests output (#15922) (#15964)

Signed-off-by: Andres Taylor <[email protected]>

* [release-19.0] connpool: Allow time out during shutdown (#15979) (#16003)

Signed-off-by: Vicent Marti <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] fix: remove keyspace when merging subqueries (#16019) (#16027)

Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] Add DCO workflow (#16052) (#16056)

Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] Upgrade the Golang version to `go1.22.4` (#16061)

Signed-off-by: GitHub <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: frouioui <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>

* [release-19.0] Remove DCO workaround (#16087) (#16091)

Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] Do not load table stats when booting `vttablet`. (#15715) (#16100)

Signed-off-by: Arthur Schreiber <[email protected]>
Co-authored-by: Arthur Schreiber <[email protected]>

* [release-19.0] Add timeout to all the contexts used for RPC calls in vtorc (#15991) (#16103)

Signed-off-by: Manan Gupta <[email protected]>

* [release-19.0] Update braces package (#16115) (#16118)

Signed-off-by: Frances Thai <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] fix: order by subquery planning (#16049) (#16132)

Co-authored-by: Harshit Gangal <[email protected]>
Co-authored-by: Andres Taylor <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>

* [release-19.0] Fix `vtexplain` not handling `UNION` queries with `weight_string` results correctly. (#16129) (#16157)

Signed-off-by: Arthur Schreiber <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>
Co-authored-by: Arthur Schreiber <[email protected]>

* Run more test on release-19 branch (#16152)

Signed-off-by: Harshit Gangal <[email protected]>

* [release-19.0] Fix flakiness in `vtexplain` unit test case. (#16159) (#16167)

Signed-off-by: Arthur Schreiber <[email protected]>
Co-authored-by: Arthur Schreiber <[email protected]>

* [release-19.0] Online DDL shadow table: rename referenced table name in self referencing FK (#16205) (#16207)

Signed-off-by: Shlomi Noach <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] Fix flaky tests that use vtcombo (#16178) (#16212)

Signed-off-by: Manan Gupta <[email protected]>
Co-authored-by: Manan Gupta <[email protected]>
Co-authored-by: Manan Gupta <[email protected]>

* [release-19.0] Handle Nullability for Columns from Outer Tables (#16174) (#16185)

Co-authored-by: Andrés Taylor <[email protected]>

* [release-19.0] VDiff CLI: Fix VDiff `show` bug (#16177) (#16198)

Signed-off-by: Rohit Nayak <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] VReplication Workflow: set state correctly when restarting workflow streams in the copy phase (#16217) (#16222)

Signed-off-by: Rohit Nayak <[email protected]>
Co-authored-by: Rohit Nayak <[email protected]>

* [release-19.0] vtctldclient: Apply (Shard | Keyspace| Table) Routing Rules commands don't work (#16096) (#16124)

Signed-off-by: Rohit Nayak <[email protected]>
Co-authored-by: Rohit Nayak <[email protected]>

* [release-19.0] Fix vtgate crash in group concat (#16254)

Signed-off-by: Manan Gupta <[email protected]>

* [release-19.0] Fix Incorrect Optimization with LIMIT and GROUP BY (#16263) (#16267)

Signed-off-by: Andres Taylor <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>
Co-authored-by: Florent Poinsard <[email protected]>
Co-authored-by: Andres Taylor <[email protected]>

* [release-19.0] Fix the `v19.0.0` release notes and use the `vitess/lite` image for the MySQL container (#16282) (#16285)

Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>

* [release-19.0] VReplication: Properly handle target shards w/o a primary in Reshard (#16283) (#16291)

Signed-off-by: Matt Lord <[email protected]>
Co-authored-by: Matt Lord <[email protected]>

* [release-19.0] CI: Fix for xtrabackup install failures (#16329) (#16332)

Signed-off-by: Matt Lord <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>
Co-authored-by: Matt Lord <[email protected]>

* [release-19.0] Upgrade the Golang version to `go1.22.5` (#16322)

Signed-off-by: GitHub <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: frouioui <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>

* [release-19.0] Fix the install dependencies script in Docker (#16340) (#16346)

Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] planner: Handle ORDER BY inside derived tables (#16353) (#16359)

Signed-off-by: Andres Taylor <[email protected]>
Co-authored-by: Andres Taylor <[email protected]>

* [release-19.0] Fix Join Predicate Cleanup Bug in Route Merging (#16386) (#16389)

Signed-off-by: Andres Taylor <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: Andres Taylor <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>

* [release-19.0] fix issue with aggregation inside of derived tables (#16366) (#16384)

Signed-off-by: Andres Taylor <[email protected]>
Co-authored-by: Andrés Taylor <[email protected]>

* [release-19.0] Use default schema reload config values when config file is empty (#16393) (#16410)

Signed-off-by: Matt Lord <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] Fix subquery planning having an aggregation that is used in order by as long as we can merge it all into a single route (#16402) (#16407)

Signed-off-by: Manan Gupta <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] Fix panic in schema tracker in presence of keyspace routing rules (#16383) (#16406)

Signed-off-by: Manan Gupta <[email protected]>

* [release-19] Vitess tester workflow (#16127) (#16418)

Signed-off-by: Manan Gupta <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>

* [release-19.0] feat: add a LIMIT 1 on EXISTS subqueries to limit network overhead (#16153) (#16191)

Signed-off-by: Andres Taylor <[email protected]>
Co-authored-by: Andrés Taylor <[email protected]>

* [release-19.0] Code Freeze for `v19.0.5` (#16448)

Signed-off-by: Andres Taylor <[email protected]>

* [release-19.0] Release of `v19.0.5` (#16450)

Signed-off-by: Andres Taylor <[email protected]>

* [release-19.0] Bump to `v19.0.6-SNAPSHOT` after the `v19.0.5` release (#16456)

Signed-off-by: Andres Taylor <[email protected]>

* [release-19.0] fix: reference table join merge (#16488) (#16496)

Signed-off-by: Harshit Gangal <[email protected]>
Signed-off-by: Andres Taylor <[email protected]>
Co-authored-by: Harshit Gangal <[email protected]>
Co-authored-by: Andres Taylor <[email protected]>

* [release-19.0] Improve the queries upgrade/downgrade CI workflow by using same test code version as binary (#16494) (#16501)

Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>

* [release-19.0] bugfix: don't treat join predicates as filter predicates (#16472) (#16474)

Signed-off-by: Andres Taylor <[email protected]>
Co-authored-by: Andrés Taylor <[email protected]>

* [release-19.0] VTAdmin: Upgrade websockets js package (#16504) (#16512)

Signed-off-by: Matt Lord <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>
Co-authored-by: Matt Lord <[email protected]>

* [release-19.0] bugfix: Allow cross-keyspace joins (#16520) (#16523)

Signed-off-by: Andres Taylor <[email protected]>
Co-authored-by: Andrés Taylor <[email protected]>

* [release-19.0] simplify merging logic (#16525) (#16532)

Signed-off-by: Andres Taylor <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] Fix: Offset planning in hash joins (#16540) (#16551)

Signed-off-by: Manan Gupta <[email protected]>
Co-authored-by: Manan Gupta <[email protected]>
Co-authored-by: Manan Gupta <[email protected]>

* [release-19.0] Fix `RemoveTablet` during `TabletExternallyReparented` causing connection issues (#16371) (#16567)

Signed-off-by: Arthur Schreiber <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* v19 backport: Throttler/vreplication: fix app name used by VPlayer (#16578) (#16580)

Signed-off-by: Shlomi Noach <[email protected]>

* [release-19.0] Upgrade the Golang version to `go1.22.6` (#16543)

Signed-off-by: GitHub <[email protected]>
Signed-off-by: Shlomi Noach <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: frouioui <[email protected]>
Co-authored-by: Shlomi Noach <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>

* v19 backport: Online DDL: avoid SQL's `CONVERT(...)`, convert programmatically if needed (#16603)

Signed-off-by: Shlomi Noach <[email protected]>

* [release-19.0] Remove mysql57/percona57 bootstrap images (#16620) (#16622)

Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>

* [release-19.0] Fix query plan cache misses metric (#16562) (#16627)

Signed-off-by: shanth96 <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] VReplication workflows: retry "wrong tablet type" errors (#16645) (#16652)

Signed-off-by: Rohit Nayak <[email protected]>
Co-authored-by: Rohit Nayak <[email protected]>
Co-authored-by: Rohit Nayak <[email protected]>

* [release-19.0] VStream API: validate that last PK has fields defined (#16478) (#16486)

Signed-off-by: Rohit Nayak <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>
Co-authored-by: Rohit Nayak <[email protected]>

* [release-19.0] Update micromatch to 4.0.8 (#16660) (#16666)

Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] Replace ErrorContains checks with Error checks before running upgrade downgrade (#16700)

Signed-off-by: Manan Gupta <[email protected]>

* [release-19.0] JSON Encoding: Use Type_RAW for marshalling json (#16637) (#16681)

Signed-off-by: Rohit Nayak <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>
Co-authored-by: Rohit Nayak <[email protected]>

* [release-19.0] FindErrantGTIDs: superset is not an errant GTID situation (#16725) (#16728)

Signed-off-by: deepthi <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] Move from 4-cores larger runners to `ubuntu-latest` (#16714) (#16717)

Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>

* [release-19.0] Upgrade the Golang version to `go1.22.7` (#16721)

Signed-off-by: GitHub <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: frouioui <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>

* [release-19.0] Code Freeze for `v19.0.6` (#16745)

Signed-off-by: Rohit Nayak <[email protected]>

* [release-19.0] Release of `v19.0.6` (#16747)

Signed-off-by: Rohit Nayak <[email protected]>

* [release-19.0] Bump to `v19.0.7-SNAPSHOT` after the `v19.0.6` release (#16753)

Signed-off-by: Rohit Nayak <[email protected]>

* [release-19.0] Remove mysql57 from docker images (#16763)

Signed-off-by: Florent Poinsard <[email protected]>

* [release-19.0] VTAdmin: Address security vuln in path-to-regexp node pkg (#16770) (#16772)

Signed-off-by: Matt Lord <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>
Co-authored-by: Matt Lord <[email protected]>

* Backport: Fix ACL checks for CTEs (#16642) (#16776)

Signed-off-by: Manan Gupta <[email protected]>
Signed-off-by: Andres Taylor <[email protected]>
Co-authored-by: Manan Gupta <[email protected]>

* [release-19.0] VTAdmin: Fix serve-handler's path-to-regexp dep and add default schema refresh (#16778) (#16783)

Signed-off-by: Matt Lord <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>
Co-authored-by: Matt Lord <[email protected]>

* [release-19.0] Bump com.google.protobuf:protobuf-java from 3.24.3 to 3.25.5 in /java (#16809) (#16837)

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* [release-19.0] VTAdmin: Upgrade deps to address security vulns (#16843) (#16846)

Signed-off-by: Matt Lord <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>
Co-authored-by: Matt Lord <[email protected]>

* [release-19.0] Support passing filters to `discovery.NewHealthCheck(...)` (#16170) (#16871)

Signed-off-by: Tim Vaillancourt <[email protected]>

* [release-19.0] Fail fast when builtinbackup fails to restore a single file (#16856) (#16867)

Signed-off-by: Florent Poinsard <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>
Co-authored-by: Matt Lord <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>

* [release-19.0] Upgrade Golang to 1.22.8 (#16895)

Signed-off-by: Florent Poinsard <[email protected]>

* [release-19.0] VTTablet: smartconnpool: notify all expired waiters (#16897) (#16901)

Signed-off-by: Brendan Dougherty <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] Fix race in `replicationLagModule` of `go/vt/throttle` (#16078) (#16899)

Signed-off-by: Tim Vaillancourt <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>
Co-authored-by: Tim Vaillancourt <[email protected]>

* [release-19.0] Bump commons-io:commons-io from 2.7 to 2.14.0 in /java (#16889) (#16930)

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* [release-19.0] fixes bugs around expression precedence and LIKE (#16934 & #16649) (#16945)

Signed-off-by: Andres Taylor <[email protected]>
Signed-off-by: Manan Gupta <[email protected]>
Co-authored-by: Andrés Taylor <[email protected]>
Co-authored-by: Manan Gupta <[email protected]>
Co-authored-by: Manan Gupta <[email protected]>

* [release-19.0] Flaky test fixes (#16940) (#16958)

Signed-off-by: Manan Gupta <[email protected]>
Co-authored-by: Manan Gupta <[email protected]>

* [release-19.0] fix: route engine to handle column truncation for execute after lookup (#16981) (#16984)

Signed-off-by: Harshit Gangal <[email protected]>
Co-authored-by: Harshit Gangal <[email protected]>

* [release-19.0] bugfix: add HAVING columns inside derived tables (#16976) (#16978)

Signed-off-by: Andres Taylor <[email protected]>
Co-authored-by: Andrés Taylor <[email protected]>

* [release-19.0] Fix deadlock between health check and topology watcher (#16995) (#17008)

Signed-off-by: Manan Gupta <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] Add support for `MultiEqual` opcode for lookup vindexes. (#16975) (#17039)

Signed-off-by: Arthur Schreiber <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>

* [release-19.0] bugfix: treat EXPLAIN like SELECT (#17054) (#17056)

Signed-off-by: Andres Taylor <[email protected]>
Co-authored-by: Andrés Taylor <[email protected]>

* [release-19.0] Delegate Column Availability Checks to MySQL for Single-Route Queries (#17077) (#17085)

Signed-off-by: Harshit Gangal <[email protected]>
Signed-off-by: Andres Taylor <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>
Co-authored-by: Andres Taylor <[email protected]>
Co-authored-by: Harshit Gangal <[email protected]>

* Bugfix for Panic on Joined Queries with Non-Authoritative Tables in Vitess 19.0 (#17103)

Signed-off-by: Andres Taylor <[email protected]>

* [release-19.0] Improve Schema Engine's TablesWithSize80 query (#17066) (#17089)

Signed-off-by: Shlomi Noach <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>
Co-authored-by: Shlomi Noach <[email protected]>

* [release-19.0] Fix unreachable errors when taking a backup (#17062) (#17110)

Signed-off-by: Florent Poinsard <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>
Co-authored-by: Matt Lord <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>

* [release-19.0] Code Freeze for `v19.0.7` (#17148)

Signed-off-by: Rohit Nayak <[email protected]>
Co-authored-by: Rohit Nayak <[email protected]>

* [release-19.0] Release of `v19.0.7` (#17149)

Signed-off-by: Rohit Nayak <[email protected]>
Co-authored-by: Rohit Nayak <[email protected]>

* restore test conditional for v18 vttablet

Signed-off-by: Tim Vaillancourt <[email protected]>

* restore more test conditional for v18 binaries

Signed-off-by: Tim Vaillancourt <[email protected]>

* restore whitespace

Signed-off-by: Tim Vaillancourt <[email protected]>

* Revert "[release-19.0] Improve the queries upgrade/downgrade CI workflow by using same test code version as binary (#16494) (#16501)"

This reverts commit 25a80ac.

* add missing table from cleanup

Signed-off-by: Tim Vaillancourt <[email protected]>

---------

Signed-off-by: Andres Taylor <[email protected]>
Signed-off-by: notfelineit <[email protected]>
Signed-off-by: <>
Signed-off-by: bddicken <[email protected]>
Signed-off-by: Harshit Gangal <[email protected]>
Signed-off-by: Vicent Marti <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Signed-off-by: GitHub <[email protected]>
Signed-off-by: Arthur Schreiber <[email protected]>
Signed-off-by: Manan Gupta <[email protected]>
Signed-off-by: Frances Thai <[email protected]>
Signed-off-by: Shlomi Noach <[email protected]>
Signed-off-by: Rohit Nayak <[email protected]>
Signed-off-by: Florent Poinsard <[email protected]>
Signed-off-by: Matt Lord <[email protected]>
Signed-off-by: shanth96 <[email protected]>
Signed-off-by: deepthi <[email protected]>
Signed-off-by: dependabot[bot] <[email protected]>
Signed-off-by: Tim Vaillancourt <[email protected]>
Signed-off-by: Brendan Dougherty <[email protected]>
Co-authored-by: Andrés Taylor <[email protected]>
Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com>
Co-authored-by: Frances Thai <[email protected]>
Co-authored-by: Harshit Gangal <[email protected]>
Co-authored-by: vitess-bot <[email protected]>
Co-authored-by: frouioui <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>
Co-authored-by: Arthur Schreiber <[email protected]>
Co-authored-by: Manan Gupta <[email protected]>
Co-authored-by: Manan Gupta <[email protected]>
Co-authored-by: Rohit Nayak <[email protected]>
Co-authored-by: Florent Poinsard <[email protected]>
Co-authored-by: Matt Lord <[email protected]>
Co-authored-by: Shlomi Noach <[email protected]>
Co-authored-by: Rohit Nayak <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Backport to: release-19.0 Needs to be back ported to release-19.0 Backport to: release-20.0 Needs to be backport to release-20.0 Component: Backup and Restore Type: Bug
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Errors while restoring a file do not fail fast
3 participants