ML-DSA: AVX2 target feature #642

jschneider-bensch · 2024-10-21T11:54:54Z

This PR performs the same kind of restructuring to ML-DSA as #636 does to ML-KEM, i.e. once we know we're running on an AVX2 machine, manually enable AVX2 specific optimizations using #[target_feature(enable = "avx2")] and then inline most functions that are further down in the call-tree.
Doing this should get us close to compiling the whole crate using RUSTFLAGS="-C target-feature=+avx2" in terms of performance.

Changes cherry-picked from `07084ab4` and `6022f6e4`

Changes cherry-picked from `a87318d8`

jschneider-bensch · 2024-11-06T14:00:49Z

~~The F* lax checking changes regressed performance again. Looking into why this happened.~~

It seems 0703c5ba349bf587e1cfb3f9628fc61693e61119 was the commit that originally regressed performance before rebasing onto the lax checking branch.

jschneider-bensch · 2024-11-07T14:21:56Z

I have another version of this branch at https://github.com/cryspen/libcrux/tree/jonas/ml-dsa-target-feature-backup which is pre-rebase on top of the laxing changes. There, reverting the above commit containing SHA-3 changes fully restores performance, but leads to a stack overflow. Reverting the commit on this branch does not seem to help performance.

@franziskuskiefer @karthikbhargavan Shall we try and get this PR in with somewhat degraded performance (e.g. for me the difference between the overflowing performant code and the laxing, non-overflowing code is ~10x), and look into fully restoring performance in a follow-up?

This reverts commit 0703c5b.

franziskuskiefer

Let's get this in and then investigate the performance regression.

Base automatically changed from jonas/ml-kem-target-feature to main October 21, 2024 21:19

franziskuskiefer mentioned this pull request Nov 1, 2024

Lax Checking for ML-DSA #646

Merged

jschneider-bensch added 12 commits November 4, 2024 14:58

Enable AVX2 target feature

1ec5a9f

Inlining to help AVX2 optimization

0031d3c

Missing module

4bc6554

Attempt to reduce stack size

f946f53

Format

a86ce00

ACVP uninlined inner functions

23120b2

Remove Zeta arrays

30c7de7

Changes cherry-picked from `07084ab4` and `6022f6e4`

SHA-3 AVX2 target feature

0703c5b

Changes cherry-picked from `a87318d8`

Inlining + target feature changes around inverse NTT

5d85370

Changes cherry-picked from `a87318d8`

Header-only extraction update

6c8fdc3

Update C extraction

51e2d6d

Fix paths

61f72fd

jschneider-bensch force-pushed the jonas/ml-dsa-target-feature branch from 3c5b96d to 61f72fd Compare November 4, 2024 14:12

jschneider-bensch changed the base branch from main to ml-dsa-lax November 4, 2024 14:13

Merge branch 'ml-dsa-lax' into jonas/ml-dsa-target-feature

5c005e0

Base automatically changed from ml-dsa-lax to main November 5, 2024 11:42

jschneider-bensch and others added 8 commits November 5, 2024 13:19

Merge branch 'main' into jonas/ml-dsa-target-feature

5f88703

Missing opaque_types in hash_functions

654d836

Make trait impl functions into wrappers

a19752d

Don't use trait methods

837d70f

Update F*

873ccf8

Guard target_feature to cfg(not(hax))

b1ad8bb

Update F*

0dccff5

Use local functions in favor of macros to help F*

11b5102

karthikbhargavan and others added 2 commits November 8, 2024 08:24

Merge branch 'main' into jonas/ml-dsa-target-feature

66059e5

Revert "SHA-3 AVX2 target feature"

8f41090

This reverts commit 0703c5b.

franziskuskiefer added 2 commits November 8, 2024 08:56

inline ntt

a31e411

update mlkem C code

1cefb79

franziskuskiefer approved these changes Nov 8, 2024

View reviewed changes

franziskuskiefer enabled auto-merge November 8, 2024 10:27

franziskuskiefer added this pull request to the merge queue Nov 8, 2024

Merged via the queue into main with commit d4b585d Nov 8, 2024
52 checks passed

franziskuskiefer deleted the jonas/ml-dsa-target-feature branch November 8, 2024 11:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ML-DSA: AVX2 target feature #642

ML-DSA: AVX2 target feature #642

jschneider-bensch commented Oct 21, 2024 •

edited

Loading

jschneider-bensch commented Nov 6, 2024 •

edited

Loading

jschneider-bensch commented Nov 7, 2024

franziskuskiefer left a comment

ML-DSA: AVX2 target feature #642

ML-DSA: AVX2 target feature #642

Conversation

jschneider-bensch commented Oct 21, 2024 • edited Loading

jschneider-bensch commented Nov 6, 2024 • edited Loading

jschneider-bensch commented Nov 7, 2024

franziskuskiefer left a comment

Choose a reason for hiding this comment

jschneider-bensch commented Oct 21, 2024 •

edited

Loading

jschneider-bensch commented Nov 6, 2024 •

edited

Loading