`ExecuteFetch`: error on multiple result sets #14949

shlomi-noach · 2024-01-15T09:04:52Z

Description

ExecuteFetch now returns an error when faced with multiple result sets. It should only be used when a single result set is expected. See #14948

The main change in this PR is in go/mysql/query.go. The rest of the PR is adaptation of many tests to the new logic, as well as fixing the logic of some tests, that were missing error responses due to the previous nature of ExecuteFetch behavior.

This PR should not be backported.

Related Issue(s)

Fixes #14948

Checklist

"Backport to:" labels have been added if this change should be back-ported to release branches
If this change is to be back-ported to previous releases, a justification is included in the PR description
Tests were added or are not required
Did the new or modified tests pass consistently locally and on CI?
Documentation was added or is not required

Deployment Notes

Signed-off-by: Shlomi Noach <[email protected]>

vitess-bot · 2024-01-15T09:04:56Z

shlomi-noach · 2024-01-15T09:05:30Z

go/mysql/query.go

+	result, more, err = c.ExecuteFetchMulti(query, maxrows, wantfields)
+	if more {
+		return nil, vterrors.Errorf(vtrpc.Code_INTERNAL, "unexpected multiple results. Use ExecuteFetchMulti instead")
+	}


Question: should we iterate and consume all results?

I think we must here, otherwise we leave the connection in an invalid state. And a subsequent query on the same connection would see the previous result here.

@shlomi-noach are there other places in the code base where we pass in 0 incorrectly and do want to consume all the results?

I'm not sure. ExecuteFetchMulti potentially? But then, this bugs me, because we should be able to pass maxrows = 17 in any place, so why would the draining in ExecuteFetch necessarily have to use -1? And yet, it does, as per #14949 (comment). I'm not sure if this is again limited to stored procedure behavior. I don't think it is.

There's not other explicit c.ReadQueryResult(0, ...) call in the code, FWIW.

@harshit-gangal further edited the ExecuteFetch/drain logic to fix potential leaks, and consolidated the draining logic. I think we should be good now.

…lt set errors Signed-off-by: Shlomi Noach <[email protected]>

shlomi-noach · 2024-01-15T09:49:38Z

As mostly expected, we get a bunch of CI errors. We should start tackling them one by one.

Signed-off-by: Shlomi Noach <[email protected]>

shlomi-noach · 2024-01-24T05:58:19Z

I think it the procedure above that is corrupting the connection.

Let me look into that. It's not supposed to happen now that we drain the connection. If anything, it shows that what we're trying to fix here is a nasty bug.

shlomi-noach · 2024-01-24T06:47:56Z

I see the problem, and I have a simple patch to solve it, which is a bit of a cheat, and I'd like to understand this better.

As @harshit-gangal suggested, the reason the test was failing was that the connection had result sets from the previous (multi result-set) test. Which surprised me, because the very essence of this PR is to drain all result sets in ExecuteFetch, so how could there be any leftover result sets?

I found the the way QueryExecutor drains results is different than how ExecuteFetch drains results... And I'm not sure why the QueryExecutor technique works where ExecuteFetch does not. Is it related specifically to how stored procedures work?

The QueryExecutor way:

func (qre *QueryExecutor) drainResultSetOnConn(conn *connpool.Conn) error {
	more := true
	for more {
		qr, err := conn.FetchNext(qre.ctx, int(qre.getSelectLimit()), true)
		if err != nil {
			return err
		}
		more = qr.IsMoreResultsExists()
	}
	return nil
}

The ExecuteFetch way:

func (c *Conn) ExecuteFetch(query string, maxrows int, wantfields bool) (result *sqltypes.Result, err error) {
	var more bool
	result, more, err = c.ExecuteFetchMulti(query, maxrows, wantfields)
	if more {
		// Multiple results are unexpected. Prioritize this "unexpected" error over whatever error we got from the first result.
		err = errors.Join(ErrExecuteFetchMultipleResults, err)
	}
	// Even though we do not allow multiple result sets, we still prefer to drain them so as to clean the connection, as well as
	// exhaust any further possible error.
	for more {
		var moreErr error
		_, more, _, moreErr = c.ReadQueryResult(0, false)
		if err != nil {
			err = errors.Join(err, moreErr)
		}
	}
	return result, err
}

My solution, BTW, which is a bit of a cheat, is to close the connection when multipel result sets are found, like so:

	qr, err := qre.execDBConn(conn.Conn, sql, true)
	if errors.UnwrappedIs(err, mysql.ErrExecuteFetchMultipleResults) {
		conn.Close()
		return nil, vterrors.New(vtrpcpb.Code_UNIMPLEMENTED, "Multi-Resultset not supported in stored procedure")
	}

It works, but I suspect there's better way to make this work.

shlomi-noach · 2024-01-24T07:05:11Z

Found it! The difference was that ExecuteFetch was running:

_, more, _, moreErr = c.ReadQueryResult(0, false)

Using 0 maxrows. Which means some reault rows were left in the pipe. When changing to -1 (unlimited) the problem goes away.

Signed-off-by: Shlomi Noach <[email protected]>

Signed-off-by: Harshit Gangal <[email protected]>

… already exists (errno 1050) (sqlstate 42S01)' error Signed-off-by: Shlomi Noach <[email protected]>

shlomi-noach · 2024-01-24T10:06:07Z

Good to review!

dbussink · 2024-01-24T11:11:29Z

go/mysql/query.go

+// caring for any results. The function returns an error if any of the statements fail.
+// The function drains the query results of all statements, even if there's an error.
+func (c *Conn) ExecuteFetchMultiDrain(query string) (err error) {
+	_, more, err := c.ExecuteFetchMulti(query, 0, false)


@shlomi-noach is the 0 here correct or could that result in a similar issue?

It's correct, because the first query runs through ReadQueryResult which drains the results. Then, ExecuteFetchMultiDrain actively drains the results of the remaining queries. What;'s important here is that we don't do double-draining on the same result, which is what happened previously and was fixed both my my change and then by @harshit-gangal 's change on top.

We can set -1 for visual consistency with other calls, too. That would be fine.

Isn’t 0 then much more efficient since we can avoid loading a whole bunch of data and allocating objects for it which we then drop here?

And does that imply we should be using 0 more in the case where we only care about if we get an error or not and not the actual result?

Dunno if we have many more of those cases in the code base though.

I'm trying to make this work but keep getting the double-drain problem. @harshit-gangal is there a way for us to use 0 so that we don't read excessive rows into memory, and still drain correctly?

Hmmm actually I still have errors.

OK, this passes tests:

diff --git a/go/mysql/query.go b/go/mysql/query.go index 18e9872752..d0f68ec35c 100644 --- a/go/mysql/query.go +++ b/go/mysql/query.go @@ -39,6 +39,13 @@ var ( ErrExecuteFetchMultipleResults = vterrors.Errorf(vtrpc.Code_INTERNAL, "unexpected multiple results. Use ExecuteFetchMulti instead.") ) +const ( + // Use as `maxrows` in `ExecuteFetch` and related functions, to indicate no rows should be fetched. + // This is different than specifying `0`, because `0` means "expect zero results", while this means + // "do not attempt to read any results into memory". + FETCH_NO_ROWS = math.MinInt +) + // // Client side methods. // @@ -322,7 +329,7 @@ func (c *Conn) ExecuteFetch(query string, maxrows int, wantfields bool) (result // caring for any results. The function returns an error if any of the statements fail. // The function drains the query results of all statements, even if there's an error. func (c *Conn) ExecuteFetchMultiDrain(query string) (err error) { - _, more, err := c.ExecuteFetchMulti(query, 0, false) + _, more, err := c.ExecuteFetchMulti(query, FETCH_NO_ROWS, false) return c.drainMoreResults(more, err) } @@ -331,7 +338,7 @@ func (c *Conn) ExecuteFetchMultiDrain(query string) (err error) { func (c *Conn) drainMoreResults(more bool, err error) error { for more { var moreErr error - _, more, _, moreErr = c.ReadQueryResult(-1, false) + _, more, _, moreErr = c.ReadQueryResult(FETCH_NO_ROWS, false) err = errors.Join(err, moreErr) } return err @@ -451,6 +458,9 @@ func (c *Conn) ReadQueryResult(maxrows int, wantfields bool) (*sqltypes.Result, if err != nil { return nil, false, 0, sqlerror.NewSQLError(sqlerror.CRServerLost, sqlerror.SSUnknownSQLState, "%v", err) } + if maxrows == FETCH_NO_ROWS { + return result, more, warnings, nil + } if c.isEOFPacket(data) { defer c.recycleReadPacket()

Committed as a30c804, let's see how the entire CI reacts.

CI looks happy. @harshit-gangal what do you think?

It looks good to me

Signed-off-by: Shlomi Noach <[email protected]>

shlomi-noach · 2024-01-31T17:34:08Z

This PR is looking for two approvals!

shlomi-noach · 2024-01-31T17:41:05Z

Actually, I'm adding a couple more unit tests here. Something I noticed.

Signed-off-by: Shlomi Noach <[email protected]>

shlomi-noach · 2024-01-31T18:10:22Z

OK, I further modified the FETCH_NO_ROWS behavior: https://github.com/vitessio/vitess/pull/14949/files/a30c8045ff79dcd5fede6a27a57dc7ac48327411..db5de4f30e2afc701dd5bc1d9d0097e427fdd4a1

I added a couple unit tests that showed that ExecuteFetchMultiDrain left the connection in invalid state, and followup queries were failing. This commit now plays it very safe: it does iterate the packet for resulting rows, it just does nothing with them. It is more wasteful in that it iterates all results.

shlomi-noach · 2024-02-06T07:28:57Z

Looking for another review.

vmg

Looks well thought out to me.

Signed-off-by: Vicent Marti <[email protected]>

vmg · 2024-02-14T09:05:15Z

@shlomi-noach I pushed a small commit to revert your UnwrappedIs helper. If you look at the implementation of errors.Is in the standard library, you'll notice you've re-implemented the same function!

https://cs.opensource.google/go/go/+/refs/tags/go1.22.0:src/errors/wrap.go;l=68-73

Also notably, if you change your unit test to use errors.Is, it's also still green. So I removed that unit test too. :)

shlomi-noach · 2024-02-14T09:33:05Z

go/errors/errors_test.go

-			name = tcase.err.Error()
-		}
-		t.Run(name, func(t *testing.T) {
-			is := UnwrappedIs(tcase.err, tcase.target)


@vmg if I replace is := UnwrappedIs with is := errors.Is then there actually is a test failure, but it's for testing whether nil error is nil, which returns true for errors.Is and false for UnwrappedIs. I'm perfectly happy with returning true so the change is good.

Right, I did find that out. It seems like errors.Is(nil, nil) == true is very sensible behavior.

shlomi-noach added 2 commits January 15, 2024 10:58

ExecuteFetch: error on multiple results

222a174

Signed-off-by: Shlomi Noach <[email protected]>

more suggestive description

0125804

Signed-off-by: Shlomi Noach <[email protected]>

shlomi-noach added Type: Bug Component: Query Serving Component: General Changes throughout the code base labels Jan 15, 2024

shlomi-noach requested review from harshit-gangal, systay and mattlord as code owners January 15, 2024 09:04

shlomi-noach requested a review from a team January 15, 2024 09:04

shlomi-noach commented Jan 15, 2024

View reviewed changes

github-actions bot added this to the v19.0.0 milestone Jan 15, 2024

exhaust further result sets. Prioritize multi-results error over resu…

48bc86d

…lt set errors Signed-off-by: Shlomi Noach <[email protected]>

shlomi-noach removed NeedsDescriptionUpdate The description is not clear or comprehensive enough, and needs work NeedsWebsiteDocsUpdate What it says NeedsIssue A linked issue is missing for this Pull Request labels Jan 15, 2024

shlomi-noach mentioned this pull request Jan 15, 2024

Protect ExecuteFetchAsDBA against multi-statements, excluding a sequence of CREATE TABLE|VIEW. #14954

Merged

5 tasks

break down ExecuteFetch multi-statements

232da20

Signed-off-by: Shlomi Noach <[email protected]>

shlomi-noach requested review from deepthi and GuptaManan100 as code owners January 15, 2024 16:53

shlomi-noach added 5 commits January 15, 2024 19:03

ExecuteFetchMultiDrain

a6d97f5

Signed-off-by: Shlomi Noach <[email protected]>

remove uneccessary (though unharmful) semicolon

4e3544f

Signed-off-by: Shlomi Noach <[email protected]>

fix multi statements

a4d0271

Signed-off-by: Shlomi Noach <[email protected]>

fix multi statements

7b28d8f

Signed-off-by: Shlomi Noach <[email protected]>

fix multi statements

2cde946

Signed-off-by: Shlomi Noach <[email protected]>

shlomi-noach and others added 6 commits January 24, 2024 09:13

test both before and after multi-result procs

dcffe94

Signed-off-by: Shlomi Noach <[email protected]>

do not limit rows, so that we can consume them all

404ddb6

Signed-off-by: Shlomi Noach <[email protected]>

remove grant relaxation patch

fbb23a6

Signed-off-by: Shlomi Noach <[email protected]>

resolved conflict

4351f12

Signed-off-by: Shlomi Noach <[email protected]>

refactor: move drain to it's own little method

8e1eb7f

Signed-off-by: Harshit Gangal <[email protected]>

fix test: drop table (previously there was a hidden 'Table 'test_idx'…

50c2243

… already exists (errno 1050) (sqlstate 42S01)' error Signed-off-by: Shlomi Noach <[email protected]>

dbussink reviewed Jan 24, 2024

View reviewed changes

shlomi-noach removed the NeedsBackportReason If backport labels have been applied to a PR, a justification is required label Jan 28, 2024

shlomi-noach requested a review from a team January 29, 2024 05:35

FETCH_NO_ROWS

a30c804

Signed-off-by: Shlomi Noach <[email protected]>

shlomi-noach added 2 commits January 31, 2024 20:04

multi-drain still fully reads packet rows, just not into memory

39ab378

Signed-off-by: Shlomi Noach <[email protected]>

code comments

db5de4f

Signed-off-by: Shlomi Noach <[email protected]>

harshit-gangal approved these changes Feb 1, 2024

View reviewed changes

frouioui modified the milestones: v19.0.0, v20.0.0 Feb 6, 2024

vmg approved these changes Feb 14, 2024

View reviewed changes

errors: do not re-implement errors.Is

9620d3c

Signed-off-by: Vicent Marti <[email protected]>

shlomi-noach commented Feb 14, 2024

View reviewed changes

vmg merged commit 8960bc3 into vitessio:main Feb 14, 2024
101 of 102 checks passed

vmg deleted the execute-fetch-error-more branch February 14, 2024 09:45

shlomi-noach mentioned this pull request Mar 18, 2024

Tracking: introduce ExecuteMultiFetchAsDba gRPC method #15505

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`ExecuteFetch`: error on multiple result sets #14949

`ExecuteFetch`: error on multiple result sets #14949

shlomi-noach commented Jan 15, 2024 •

edited

Loading

vitess-bot bot commented Jan 15, 2024

shlomi-noach Jan 15, 2024

dbussink Jan 15, 2024 •

edited

Loading

shlomi-noach Jan 15, 2024

dbussink Jan 24, 2024

shlomi-noach Jan 24, 2024

shlomi-noach Jan 24, 2024

shlomi-noach Jan 24, 2024

shlomi-noach commented Jan 15, 2024

shlomi-noach commented Jan 24, 2024

shlomi-noach commented Jan 24, 2024

shlomi-noach commented Jan 24, 2024

shlomi-noach commented Jan 24, 2024

dbussink Jan 24, 2024

shlomi-noach Jan 24, 2024

shlomi-noach Jan 24, 2024

dbussink Jan 24, 2024

shlomi-noach Jan 24, 2024

shlomi-noach Jan 30, 2024

shlomi-noach Jan 30, 2024

shlomi-noach Jan 30, 2024

shlomi-noach Jan 30, 2024

harshit-gangal Jan 31, 2024

shlomi-noach commented Jan 31, 2024

shlomi-noach commented Jan 31, 2024

shlomi-noach commented Jan 31, 2024

shlomi-noach commented Feb 6, 2024

vmg left a comment

vmg commented Feb 14, 2024

shlomi-noach Feb 14, 2024

vmg Feb 14, 2024

ExecuteFetch: error on multiple result sets #14949

ExecuteFetch: error on multiple result sets #14949

Conversation

shlomi-noach commented Jan 15, 2024 • edited Loading

Description

Related Issue(s)

Checklist

Deployment Notes

vitess-bot bot commented Jan 15, 2024

Review Checklist

General

Tests

Documentation

New flags

If a workflow is added or modified:

Backward compatibility

Choose a reason for hiding this comment

dbussink Jan 15, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

shlomi-noach commented Jan 15, 2024

shlomi-noach commented Jan 24, 2024

shlomi-noach commented Jan 24, 2024

shlomi-noach commented Jan 24, 2024

shlomi-noach commented Jan 24, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

shlomi-noach commented Jan 31, 2024

shlomi-noach commented Jan 31, 2024

shlomi-noach commented Jan 31, 2024

shlomi-noach commented Feb 6, 2024

vmg left a comment

Choose a reason for hiding this comment

vmg commented Feb 14, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

`ExecuteFetch`: error on multiple result sets #14949

`ExecuteFetch`: error on multiple result sets #14949

shlomi-noach commented Jan 15, 2024 •

edited

Loading

dbussink Jan 15, 2024 •

edited

Loading