evalengine: Internal cleanup and consistency fixes #14854

dbussink · 2023-12-22T19:23:50Z

While working on #14841 and running some tests, I ran into some other issues / consistency problems in the evalengine that we can clean up and fix.

First we use time.Duration for intervals, which already provides constants and we don't have to deal with nanoseconds then separately but they are part of durations.

We clean up the tinyweight function which does a bunch of casting which works but isn't as clear and would actually break on 32 bit (but we don't support that anyway). It now also returns 0, 1 or -1 which is more how other Go Cmp functions work.

We remove int from dataOutOfRangeError since the evalengine only works with int64 or uint64 anyway, so any usage of int would really be a bug (and we didn't deal with uint either so it was inconsistent anyway).

The bit shift operations also need to operate on int64 explicitly, since that's what the inputs are in the evalengine. So we should keep the types consistent.

Next, we were missing a now possible optimization which is that we have size for temporal times at compile time. This means we know if we need to convert to integer or decimal. We don't hit the deoptimize path anymore, and now also error hard if that happens since compilation is broken in that case.

Lastly we were not dealing with underflow / overflow checks correctly in FROM_UNIXTIME between the evaluator and compiler. We need to check before conversions, because specifically float64 to int64 conversions have badly defined behavior for large float64 values. It behaves differently on amd64 vs arm64 vs i386 for example already. Some convert large values to negative ints, others positive or even other values. By checking before casting we avoid this and can behave consistently.

Related Issue(s)

Found when working on #14841

Checklist

"Backport to:" labels have been added if this change should be back-ported to release branches
If this change is to be back-ported to previous releases, a justification is included in the PR description
Tests were added or are not required
Did the new or modified tests pass consistently locally and on CI?
Documentation was added or is not required

While working on vitessio#14841 and running some tests, I ran into some other issues / consistency problems in the evalengine that we can clean up and fix. First we use `time.Duration` for intervals, which already provides constants and we don't have to deal with nanoseconds then separately but they are part of durations. We clean up the tinyweight function which does a bunch of casting which works but isn't as clear and would actually break on 32 bit (but we don't support that anyway). It now also returns 0, 1 or -1 which is more how other Go `Cmp` functions work. We remove `int` from `dataOutOfRangeError` since the `evalengine` only works with `int64` or `uint64` anyway, so any usage of `int` would really be a bug (and we didn't deal with `uint` either so it was inconsistent anyway). The bit shift operations also need to operate on int64 explicitly, since that's what the inputs are in the `evalengine`. So we should keep the types consistent. Next, we were missing a now possible optimization which is that we have size for temporal times at compile time. This means we know if we need to convert to integer or decimal. We don't hit the deoptimize path anymore, and now also error hard if that happens since compilation is broken in that case. Lastly we were not dealing with underflow / overflow checks correctly in `FROM_UNIXTIME` between the evaluator and compiler. We need to check before conversions, because specifically float64 to int64 conversions have badly defined behavior for large float64 values. It behaves differently on amd64 vs arm64 vs i386 for example already. Some convert large values to negative ints, others positive or even other values. By checking before casting we avoid this and can behave consistently. Signed-off-by: Dirkjan Bussink <[email protected]>

vitess-bot · 2023-12-22T19:23:53Z

dbussink · 2023-12-22T19:25:52Z

go/vt/vtgate/evalengine/fn_time.go

@@ -559,52 +559,81 @@ func (b *builtinFromUnixtime) eval(env *ExpressionEnv) (eval, error) {

 	switch ts := ts.(type) {
 	case *evalInt64:
+		if ts.i < 0 || ts.i >= maxUnixtime {
+			return nil, nil
+		}


It's repetitive, but we have to check up front here. Some fun things:

amd64

Large float turns into negative int64.

int64(1.2345678912345678e300) -9223372036854775808

arm64

Large float64 turns into max int64.

int64(1.2345678912345678e+300) 9223372036854775807

This means we can't convert before checking overflow / underflow since it won't be consistent.

dbussink · 2023-12-22T19:26:36Z

go/vt/vtgate/evalengine/expr_bit.go

-		length = len(num)
+		bits   = int64(shift % 8)
+		bytes  = int64(shift / 8)
+		length = int64(len(num))


bit shifting operates on int64 so we should use that consistently.

Signed-off-by: Dirkjan Bussink <[email protected]>

dbussink · 2023-12-22T20:53:30Z

go/mysql/datetime/datetime.go

-	return (dt.Date.Day()-1)*secondsPerDay + dt.Time.toSeconds()
+func (dt DateTime) toDuration() time.Duration {
+	dur := dt.Time.toDuration()
+	if !dt.Date.IsZero() {


When we have a zero date (and thus we're a Time type), we don't include the day bit to have a correct duration.

dbussink · 2023-12-22T20:53:54Z

go/mysql/datetime/datetime.go

+		dur := dt.toDuration()
+		dur += itv.toDuration()
+		days := time.Duration(0)
+		if !dt.Date.IsZero() {


Rounding shouldn't move days if we're a Time instance. We then can have more than 24 hours.

Signed-off-by: Dirkjan Bussink <[email protected]>

dbussink requested review from harshit-gangal, systay, frouioui, GuptaManan100, shlomi-noach, vmg and mattlord as code owners December 22, 2023 19:23

github-actions bot added this to the v19.0.0 milestone Dec 22, 2023

dbussink commented Dec 22, 2023

View reviewed changes

Use correct int value

1b2ed6f

Signed-off-by: Dirkjan Bussink <[email protected]>

dbussink requested a review from rohit-nayak-ps as a code owner December 22, 2023 20:12

Handle time only values for rounding

bf08036

Signed-off-by: Dirkjan Bussink <[email protected]>

dbussink commented Dec 22, 2023

View reviewed changes

Fix additional size typing

bc268a7

Signed-off-by: Dirkjan Bussink <[email protected]>

dbussink requested a review from deepthi as a code owner December 23, 2023 07:25

mattlord approved these changes Dec 26, 2023

View reviewed changes

Merge branch 'main' into dbussink/evalengine-cleanup

f908537

Signed-off-by: Dirkjan Bussink <[email protected]>

GuptaManan100 approved these changes Dec 27, 2023

View reviewed changes

dbussink merged commit d62a5c5 into vitessio:main Dec 27, 2023
104 checks passed

dbussink deleted the dbussink/evalengine-cleanup branch December 27, 2023 07:37

dbussink mentioned this pull request Dec 29, 2023

Use one canonical style for unlimited queries #14870

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

evalengine: Internal cleanup and consistency fixes #14854

evalengine: Internal cleanup and consistency fixes #14854

dbussink commented Dec 22, 2023

vitess-bot bot commented Dec 22, 2023

dbussink Dec 22, 2023

dbussink Dec 22, 2023

dbussink Dec 22, 2023

dbussink Dec 22, 2023

evalengine: Internal cleanup and consistency fixes #14854

evalengine: Internal cleanup and consistency fixes #14854

Conversation

dbussink commented Dec 22, 2023

Related Issue(s)

Checklist

vitess-bot bot commented Dec 22, 2023

Review Checklist

General

Tests

Documentation

New flags

If a workflow is added or modified:

Backward compatibility

dbussink Dec 22, 2023

Choose a reason for hiding this comment

amd64

arm64

dbussink Dec 22, 2023

Choose a reason for hiding this comment

dbussink Dec 22, 2023

Choose a reason for hiding this comment

dbussink Dec 22, 2023

Choose a reason for hiding this comment