improvements to accuracy/performance for float^integer #24500

stevengj · 2017-11-06T22:13:34Z

This PR makes a few improvements to accuracy and/or performance for float^integer exponentiation.

Float64^integer and Float32^integer no longer check for DomainErrors, since these can only occur for fractional powers (make x^-n equivalent to inv(x)^n for literal n #24240 (comment)). This seems to greatly improve performance in some cases (by a factor of 20+) on my machine, and the performance advantage of inv(x)^n over x^-n seems to be mostly gone.
float^-n no longer uses the inv(x)^n fallback for literal n, since that degrades accuracy slightly (make x^-n equivalent to inv(x)^n for literal n #24240 (comment))

stevengj · 2017-11-07T16:43:45Z

CI failures seem unrelated.

stevengj · 2017-11-13T20:00:06Z

StefanKarpinski · 2017-11-13T20:32:41Z

Looks good to me but I'm not sure I'm the best person to review this. Maybe @vtjnash for the LLVM bit and @simonbyrne for the math bit?

simonbyrne · 2017-11-13T23:32:36Z

base/intfuncs.jl

+@inline literal_pow(::typeof(^), x::Union{Float32,Float64}, ::Val{0}) = one(x)
+@inline literal_pow(::typeof(^), x::Union{Float32,Float64}, ::Val{1}) = x
+@inline literal_pow(::typeof(^), x::Union{Float32,Float64}, ::Val{2}) = x*x
+@inline literal_pow(::typeof(^), x::Union{Float32,Float64}, ::Val{3}) = x*x*x


Is it worth adding @inline literal_pow(::typeof(^), x::Union{Float32,Float64}, ::Val{-1}) = 1/x as well?

Why are these ambiguous? LLVM already knows about how to do this rewrite, when it can show that it won't cause accuracy issues. That at least seemed to be the conclusion in #19890 (otherwise, I would have also just altered the flag to let LLVM know that it was allowed to do this rewrite).

Without these methods, literal_pow(::typeof(^), x::AbstractFloat, ::Val{p}) is ambiguous with literal_pow(::typeof(^), x::HWNumber, ::Val{2}) etc.

@simonbyrne, that couldn't hurt, I guess — might as well add it.

@vtjnash, actually, it seems like there is no longer an ambiguity — there used to be an ambiguity when I was defining literal_pow(::typeof(^), x::Union{Float32,Float64}, ::Val{p}), but it looks like it may be fine now that it is AbstractFloat.

Hm, ok. It still seems unnecessary to be defining a special function overlay to be doing something that LLVM is doing already (e.g., it already rewrites pow(x, 2) as x*x whenever possible). The same is true of -1 method proposed above: that optimization is already implemented transparently in the backend.

LLVM only does that conversion for types that it knows about. e.g. it won't do it for Complex{Float64} or DecimalFloat32.

vtjnash · 2017-11-14T02:30:49Z

base/intfuncs.jl

@@ -234,6 +234,15 @@ const HWNumber = Union{HWReal, Complex{<:HWReal}, Rational{<:HWReal}}
 @inline literal_pow(::typeof(^), x::HWNumber, ::Val{2}) = x*x
 @inline literal_pow(::typeof(^), x::HWNumber, ::Val{3}) = x*x*x

+# don't use the inv(x) transformation here since x^p is slightly more accurate


It seems like you should have to opt-in to getting a less accurate answer that violates referential transparency, rather than opt-out of it.

For most types, it's not less accurate. For most floating-point-based types (complex numbers, matrices, etc.), negative integer powers are already computed by inv(x)^-n. And for many other types (e.g. Int), negative literal powers wouldn't be defined at all without this transformation.

Can we not re-litigate the whole literal_pow debate in this PR? The question is whether this PR is an improvement on the current literal_pow rules.

vtjnash · 2017-11-14T02:36:04Z

base/intfuncs.jl

+@inline literal_pow(::typeof(^), x::Union{Float32,Float64}, ::Val{0}) = one(x)
+@inline literal_pow(::typeof(^), x::Union{Float32,Float64}, ::Val{1}) = x
+@inline literal_pow(::typeof(^), x::Union{Float32,Float64}, ::Val{2}) = x*x
+@inline literal_pow(::typeof(^), x::Union{Float32,Float64}, ::Val{3}) = x*x*x


Why are these ambiguous? LLVM already knows about how to do this rewrite, when it can show that it won't cause accuracy issues. That at least seemed to be the conclusion in #19890 (otherwise, I would have also just altered the flag to let LLVM know that it was allowed to do this rewrite).

stevengj · 2017-11-20T17:52:55Z

Bump.

stevengj · 2017-12-05T15:10:01Z

Any chance of merging this? It seems like a strict improvement over the current situation, and CI passes except for the usual AppVeyor hiccups.

iblislin · 2017-12-05T17:48:39Z

master build failed after this PR merged:

Error in testset numbers:
Test Failed at /home/julia/ci/worker/11rel-amd64/build/test/numbers.jl:2971
  Expression: 0.09496527f0 ^ -2 === 110.88438f0
   Evaluated: 110.884384f0 === 110.88438f0

seems only happened on amd64 build

Ref:

fredrikekre · 2017-12-05T19:25:10Z

Should have rerun CI here since it was over 2 weeks since the last time.

StefanKarpinski · 2017-12-05T19:39:59Z

Sorry, my bad.

KristofferC · 2017-12-05T20:02:51Z

So what do we do? Quick revert or live with it until someone fixes it?

StefanKarpinski · 2017-12-05T20:56:35Z

@stevengj – do you think you could fix this soon or should I revert this?

This reverts commit 8fa23ed.

StefanKarpinski · 2017-12-05T21:24:06Z

Here's the revert PR: #24930

simonbyrne · 2017-12-05T22:53:06Z

It appears to be due to the use of LLVM intrinsics. I think we've had trouble with those before (e.g. #2741 & #8939).

Revert "improvements to accuracy/performance for float^integer (#24500)"

stevengj · 2017-12-06T04:22:00Z

I don't really understand why there would be a problem with the LLVM intrinsics here. The ccall is virtually identical to the one in ^(x::Float64, y::Float64), just with the subsequent isnan check removed.

stevengj · 2017-12-06T04:24:27Z

I'm guessing that the code is perfectly fine, but that 0.09496527f0 ^ -2 === 110.88438f0 test is too stringent? It looks like on amd64 it is actually getting a more accurate result, which should be fine…

(Is it something about extended-precision mode on Linux?)

…Lang#24500)" This reverts commit 8fa23ed.

improvements to accuracy/performance for float^integer

14a4677

stevengj added maths Mathematical functions performance Must go faster labels Nov 6, 2017

stevengj requested a review from vtjnash November 6, 2017 22:13

eliminate inv(x)^n fallback for all AbstractFloat types

0db74f2

stevengj requested a review from StefanKarpinski November 7, 2017 23:33

StefanKarpinski approved these changes Nov 13, 2017

View reviewed changes

simonbyrne reviewed Nov 13, 2017

View reviewed changes

vtjnash reviewed Nov 14, 2017

View reviewed changes

method ambiguity code no longer needed, add back float^-1 = inv(x)

a9354df

StefanKarpinski merged commit 8fa23ed into JuliaLang:master Dec 5, 2017

stevengj deleted the betterfloatpows branch December 5, 2017 15:57

iblislin mentioned this pull request Dec 5, 2017

Doctests for some IO functions #24922

Merged

StefanKarpinski added a commit that referenced this pull request Dec 5, 2017

Revert "improvements to accuracy/performance for float^integer (#24500)"

482aafe

This reverts commit 8fa23ed.

StefanKarpinski added a commit that referenced this pull request Dec 5, 2017

Merge pull request #24930 from JuliaLang/sk/revert-24500

11d5a53

Revert "improvements to accuracy/performance for float^integer (#24500)"

stevengj restored the betterfloatpows branch December 6, 2017 04:18

stevengj deleted the betterfloatpows branch December 6, 2017 04:28

stevengj mentioned this pull request Dec 6, 2017

updated improvements to accuracy/performance for float^integer #24937

Merged

fredrikekre mentioned this pull request Dec 7, 2017

Examples and doctests for show #24931

Merged

evetion pushed a commit to evetion/julia that referenced this pull request Dec 12, 2017

improvements to accuracy/performance for float^integer (JuliaLang#24500)

6018887

evetion pushed a commit to evetion/julia that referenced this pull request Dec 12, 2017

Revert "improvements to accuracy/performance for float^integer (Julia…

25ce626

…Lang#24500)" This reverts commit 8fa23ed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improvements to accuracy/performance for float^integer #24500

improvements to accuracy/performance for float^integer #24500

stevengj commented Nov 6, 2017

stevengj commented Nov 7, 2017

stevengj commented Nov 13, 2017

StefanKarpinski commented Nov 13, 2017

simonbyrne Nov 13, 2017 •

edited

Loading

vtjnash Nov 14, 2017

stevengj Nov 14, 2017

stevengj Nov 14, 2017

stevengj Nov 14, 2017

vtjnash Nov 14, 2017

stevengj Nov 14, 2017

vtjnash Nov 14, 2017

stevengj Nov 14, 2017

vtjnash Nov 14, 2017

stevengj commented Nov 20, 2017

stevengj commented Dec 5, 2017

iblislin commented Dec 5, 2017 •

edited

Loading

fredrikekre commented Dec 5, 2017

StefanKarpinski commented Dec 5, 2017

KristofferC commented Dec 5, 2017

StefanKarpinski commented Dec 5, 2017

StefanKarpinski commented Dec 5, 2017

simonbyrne commented Dec 5, 2017 •

edited

Loading

stevengj commented Dec 6, 2017

stevengj commented Dec 6, 2017 •

edited

Loading

improvements to accuracy/performance for float^integer #24500

improvements to accuracy/performance for float^integer #24500

Conversation

stevengj commented Nov 6, 2017

stevengj commented Nov 7, 2017

stevengj commented Nov 13, 2017

StefanKarpinski commented Nov 13, 2017

simonbyrne Nov 13, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stevengj commented Nov 20, 2017

stevengj commented Dec 5, 2017

iblislin commented Dec 5, 2017 • edited Loading

fredrikekre commented Dec 5, 2017

StefanKarpinski commented Dec 5, 2017

KristofferC commented Dec 5, 2017

StefanKarpinski commented Dec 5, 2017

StefanKarpinski commented Dec 5, 2017

simonbyrne commented Dec 5, 2017 • edited Loading

stevengj commented Dec 6, 2017

stevengj commented Dec 6, 2017 • edited Loading

simonbyrne Nov 13, 2017 •

edited

Loading

iblislin commented Dec 5, 2017 •

edited

Loading

simonbyrne commented Dec 5, 2017 •

edited

Loading

stevengj commented Dec 6, 2017 •

edited

Loading