simd: implement missing intrinsics from simd/generic-arithmetic-pass.rs #382

sadlerap · 2023-11-10T03:56:15Z

This implements/fixes the following intrinsics:

simd_bswap
simd_frem
simd_bitreverse
simd_ctlz
simd_cttz
simd_neg

These were the remaining intrinsics from the simd/generic-arithmetic-pass.rs ui test, which should now pass for a patched libgccjit. However, it seems not to compile against a non-patched libgccjit, so leave it disabled there for now.

src/intrinsic/simd.rs

antoyo · 2023-11-16T15:09:48Z

I'll do the review soon, hopefully in the next few days.
I've been quite busy lately.

sadlerap · 2023-11-16T15:15:29Z

No worries, take all the time you need.

src/intrinsic/simd.rs

antoyo

A few nitpicks.

antoyo · 2023-11-23T12:44:33Z

src/intrinsic/simd.rs

+        let shuffled = hi | lo;
+        let cast_ty = bx.context.new_vector_type(elem_type, byte_vector_type_size / (elem_size_bytes as u64));
+        let loaded = bx.context.new_bitcast(None, shuffled, cast_ty);
+        let elems: Vec<_> = (0..in_len)


Do you extract the individual elements instead of casting because you want to truncate the vector?
Please add a comment to explain.

Yeah, the intention here is to truncate the vector. Is there a more idiomatic way to do this?

I've added some more documentation for how this particular algorithm is supposed to work in 8d42a82.

src/intrinsic/simd.rs

sadlerap · 2023-11-29T02:42:18Z

Apologies on the delay on getting back to your review - last week's holidays made things busy for me, and I couldn't get back to this until now.

Implements lane-local byte swapping through vector shuffles. While this is more setup than non-vector shuffles, this implementation can shuffle multiple integers concurrently. Signed-off-by: Andy Sadler <[email protected]>

Signed-off-by: Andy Sadler <[email protected]>

The simd intrinsic handler was delegating implementation of `simd_frem` to `Builder::frem`, which wasn't able to handle vector-typed inputs. To fix this, teach this method how to handle vector inputs. Signed-off-by: Andy Sadler <[email protected]>

If we're running against a patched libgccjit, use an algorithm similar to what LLVM uses for this intrinsic. Otherwise, fallback to a per-element bitreverse. Signed-off-by: Andy Sadler <[email protected]>

Signed-off-by: Andy Sadler <[email protected]>

gcc_not would panic upon encountering a vector type, which is not what we want here. Signed-off-by: Andy Sadler <[email protected]>

This test now passes when tested with a patched libgccjit. However, due to [some compiler bugs][1], we can't enable this for non-patched libgccjit yet. [1]: https://github.com/sadlerap/rustc_codegen_gcc/actions/runs/6820180639/job/18548672444#step:15:4375 Signed-off-by: Andy Sadler <[email protected]>

antoyo · 2023-12-19T18:00:44Z

Thanks a lot for your contribution!

antoyo reviewed Nov 10, 2023

View reviewed changes

src/intrinsic/simd.rs Outdated Show resolved Hide resolved

src/intrinsic/simd.rs Outdated Show resolved Hide resolved

sadlerap force-pushed the impl-generic-arithmetic-pass branch from 1ae81e9 to 1d88c5b Compare November 10, 2023 21:25

sadlerap requested a review from antoyo November 16, 2023 15:05

antoyo reviewed Nov 18, 2023

View reviewed changes

src/intrinsic/simd.rs Outdated Show resolved Hide resolved

src/intrinsic/simd.rs Show resolved Hide resolved

src/intrinsic/simd.rs Outdated Show resolved Hide resolved

src/intrinsic/simd.rs Outdated Show resolved Hide resolved

src/intrinsic/simd.rs Outdated Show resolved Hide resolved

sadlerap force-pushed the impl-generic-arithmetic-pass branch 2 times, most recently from e4b5828 to bae699f Compare November 22, 2023 04:10

antoyo reviewed Nov 23, 2023

View reviewed changes

sadlerap force-pushed the impl-generic-arithmetic-pass branch from bae699f to cee07f5 Compare November 29, 2023 02:40

sadlerap added 7 commits November 28, 2023 21:25

implement simd_bswap intrinsic

cc7c9be

Implements lane-local byte swapping through vector shuffles. While this is more setup than non-vector shuffles, this implementation can shuffle multiple integers concurrently. Signed-off-by: Andy Sadler <[email protected]>

remove generic-bswap-byte from failing test list

6d13f94

Signed-off-by: Andy Sadler <[email protected]>

fix simd_frem intrinsic implementation

70586a2

The simd intrinsic handler was delegating implementation of `simd_frem` to `Builder::frem`, which wasn't able to handle vector-typed inputs. To fix this, teach this method how to handle vector inputs. Signed-off-by: Andy Sadler <[email protected]>

impl simd_bitreverse intrinsic

8d42a82

If we're running against a patched libgccjit, use an algorithm similar to what LLVM uses for this intrinsic. Otherwise, fallback to a per-element bitreverse. Signed-off-by: Andy Sadler <[email protected]>

impl simd_ctlz/simd_cttz intrinsic

03e11a2

Signed-off-by: Andy Sadler <[email protected]>

fix simd_neg implementation for ints

3a22132

gcc_not would panic upon encountering a vector type, which is not what we want here. Signed-off-by: Andy Sadler <[email protected]>

sadlerap force-pushed the impl-generic-arithmetic-pass branch from cee07f5 to 17b2c46 Compare November 29, 2023 03:27

antoyo merged commit db49437 into rust-lang:master Dec 19, 2023
34 checks passed

sadlerap deleted the impl-generic-arithmetic-pass branch December 19, 2023 18:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

simd: implement missing intrinsics from simd/generic-arithmetic-pass.rs #382

simd: implement missing intrinsics from simd/generic-arithmetic-pass.rs #382

sadlerap commented Nov 10, 2023

antoyo commented Nov 16, 2023

sadlerap commented Nov 16, 2023

antoyo left a comment

antoyo Nov 23, 2023

sadlerap Nov 29, 2023

sadlerap Nov 29, 2023

sadlerap commented Nov 29, 2023

antoyo commented Dec 19, 2023

simd: implement missing intrinsics from simd/generic-arithmetic-pass.rs #382

simd: implement missing intrinsics from simd/generic-arithmetic-pass.rs #382

Conversation

sadlerap commented Nov 10, 2023

antoyo commented Nov 16, 2023

sadlerap commented Nov 16, 2023

antoyo left a comment

Choose a reason for hiding this comment

antoyo Nov 23, 2023

Choose a reason for hiding this comment

sadlerap Nov 29, 2023

Choose a reason for hiding this comment

sadlerap Nov 29, 2023

Choose a reason for hiding this comment

sadlerap commented Nov 29, 2023

antoyo commented Dec 19, 2023