Query properties rule filtering #2613

max-hoffman · 2024-07-31T19:29:50Z

Edit most of the analyzer interfaces to pass a new context object that accumulates query specific properties. Currently the object is called QueryFlags, and accumulates information about the query to inform better rule filtering and more efficient spooling strategies.

The change that has the biggest effect on oltp_point_select perf is the sql.QFlagMax1Row setting, which lets us skip the default results iter boilerplate when we're only returning one row. Added a couple other skips for rules that are easy to whitelist correctly and show prominently on CPU profiles, like aggregations and subqueries.

zachmu

The general idea is sound and good, but this implementation conflates two separate concerns, which is undesirable:

The analyzer wants to skip / alter certain steps based on previously determined facts about the plan
The row spooler wants to shave some time by avoiding the overhead of a loop when it's not necessary

The first concern should be encapsulated in a data structure local to a given analyzer invocation. Leaking it to the query context as a whole is not ideal. It considerably tightens the analyzer / context contract in a way that will be easy to introduce bugs into.

The second concern seems more appropriate for this implementation, but it's still not great. A better solution imo would be something like a QueryProps top-level node that the analyzer uses to communicate things of this nature to the rest of the engine. A new return type from analysis that contains [Node, Props] would also work well.

But I definitely want to push back on cramming more things into the context / session, it's already quite crowded in there. And more importantly, the lifecycle of that object is much longer / broader than the narrow concerns you're trying to address here.

zachmu · 2024-08-01T19:11:20Z

server/handler.go

@@ -499,6 +501,33 @@ func resultForEmptyIter(ctx *sql.Context, iter sql.RowIter, resultFields []*quer
 	return &sqltypes.Result{Fields: resultFields}, nil
 }

+// resultForMax1RowIter ensures that an empty iterator returns at most one row
+func resultForMax1RowIter(ctx *sql.Context, schema sql.Schema, iter sql.RowIter, resultFields []*querypb.Field) (*sqltypes.Result, error) {
+	defer trace2.StartRegion(ctx, "Handler.resultForMax1RowIter").End()


You should use a better import alias for this

zachmu · 2024-08-01T19:13:38Z

sql/analyzer/costed_index_scan.go

+		// Strict index lookup without a join or subquery scope will return
+		// at most one row. We could also use some sort of scope counting
+		// to check for single scope.
+		ctx.QProps.Set(sql.QPropMax1Row)


Kind of concerned about the lifecycle of these properties introducing correctness issues over time.

Do you have a good sense about e.g.

contexts are always created fresh for every new query (even in multiquery mode)

subcontexts don't inherit them?

zachmu · 2024-08-01T19:14:43Z

sql/analyzer/indexed_joins.go

@@ -155,6 +155,8 @@ func replanJoin(ctx *sql.Context, n *plan.JoinNode, a *Analyzer, scope *plan.Sco
 	j := memo.NewJoinOrderBuilder(m)
 	j.ReorderJoin(n)

+	ctx.QProps.Set(sql.QPropInnerJoin)


This just means there's one or more inner joins somewhere in the query?

zachmu · 2024-08-01T19:19:40Z

sql/query_props.go

+
+package sql
+
+const (


Should document these with comments

…o-mysql-server into max/query-props-rule-filtering

max-hoffman · 2024-08-05T17:09:00Z

@zachmu I refactored this to pass flags through rule handlers. Equivalent behavior, but the lifecycle is separate from the context now.

zachmu

LGTM, just one note on this implementation

zachmu · 2024-08-05T18:11:52Z

sql/query_flags.go

+	qp.Flags.Add(flag)
+}
+
+func (qp *QueryFlags) IsSet(flag int) bool {


"Everything is set if it's nil" is kind of insane bro

If you want this kind of default behavior, put it in a function up one abstraction layer (in the analyzer package), let this be a dumb data object.

max-hoffman added 11 commits July 29, 2024 18:52

Add set of query flags that let us skip optimizer rules

93a4bd5

more filtering, max1RowIter

554e93e

simplify use of flags

8b2a61c

cleanup

51f3b9f

tag joins at a safer point

d04ad35

try to repro CI failure

87c4527

Merge branch 'main' into max/query-props-rule-filtering

c5ccb2c

better comment

23da4d7

Merge branch 'main' into max/query-props-rule-filtering

3ff2ade

cleanup

a6fd01e

add not skipping back, that one should be safe

a584c06

max-hoffman requested a review from zachmu August 1, 2024 17:56

zachmu reviewed Aug 1, 2024

View reviewed changes

max-hoffman and others added 7 commits August 2, 2024 14:57

progress, triggers still breaking

ab68052

[ga-format-pr] Run ./format_repo.sh to fix formatting

12c2cf4

fix test

82f7925

Merge branch 'max/query-props-rule-filtering' of github.com:dolthub/g…

d1d87e3

…o-mysql-server into max/query-props-rule-filtering

Merge branch 'main' into max/query-props-rule-filtering

ee1ad90

change name to QueryFlags

c8199eb

doc comments

18a670e

zachmu approved these changes Aug 5, 2024

View reviewed changes

zach's comments

3b44746

max-hoffman merged commit f695d9f into main Aug 5, 2024
7 of 8 checks passed

max-hoffman deleted the max/query-props-rule-filtering branch August 5, 2024 19:20

This was referenced Aug 6, 2024

dolt 1.42.8 Homebrew/homebrew-core#180180

Merged

dolt 1.42.9 Homebrew/homebrew-core#180454

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query properties rule filtering #2613

Query properties rule filtering #2613

max-hoffman commented Jul 31, 2024 •

edited

Loading

zachmu left a comment

zachmu Aug 1, 2024

zachmu Aug 1, 2024

zachmu Aug 1, 2024

zachmu Aug 1, 2024

max-hoffman commented Aug 5, 2024

zachmu left a comment

zachmu Aug 5, 2024


		package sql

		const (

Query properties rule filtering #2613

Query properties rule filtering #2613

Conversation

max-hoffman commented Jul 31, 2024 • edited Loading

zachmu left a comment

Choose a reason for hiding this comment

zachmu Aug 1, 2024

Choose a reason for hiding this comment

zachmu Aug 1, 2024

Choose a reason for hiding this comment

zachmu Aug 1, 2024

Choose a reason for hiding this comment

zachmu Aug 1, 2024

Choose a reason for hiding this comment

max-hoffman commented Aug 5, 2024

zachmu left a comment

Choose a reason for hiding this comment

zachmu Aug 5, 2024

Choose a reason for hiding this comment

max-hoffman commented Jul 31, 2024 •

edited

Loading