perf: optimize proof generation #509

chokobole · 2024-08-07T11:42:54Z

Description

This PR optimizes parallel loops. (Including refactoring + bugfix)

TomTaehoonKim · 2024-08-08T06:49:45Z

c69c990: parallel loops -> loops?

tachyon/base/parallelize.h

tachyon/crypto/sumcheck/multilinear/sumcheck_verifier.h

tachyon/math/polynomials/univariate/radix2_evaluation_domain.h

tachyon/zk/air/plonky3/base/two_adic_multiplicative_coset.h

chokobole · 2024-08-08T07:56:42Z

c69c990: parallel loops -> loops?

I said "merge parallel loops" because F::GetSuccessivePowers() uses parallel loops independently. I'll add more details in commit body.

GideokKim

LGTM

tachyon/crypto/sumcheck/multilinear/sumcheck_verifier.h

ashjeong

79c9de0
Please change the commit body. I’m not confident my change accurately depicts the changes. Please check that it does!

Removes code duplicates and optimizes potential parallelization.
If the input of BatchNormalize() is large enough, vector creation is parallelized. Else, the
operation is run serially, reducing the number of allocations.

4669aec
Change first line of body to “merge parallel loops since F::GetSuccessivePowers() uses parallel loops independently.” ("use" -> "uses")

715db8b
Please change the commit body as such:

Reduces the number of thread joins and heap allocations.

Additionally fixes bugs in GetSelectorsOnCoset() replacing
chunk_size with len.

fb2ea95
Commit title-> “prevent possible overflow error”

0018824
Commit title-> “fix ScalarMul result of multiplying by zero”

tachyon/math/polynomials/univariate/naive_batch_fft.h

TomTaehoonKim

LGTM

ashjeong

LGTM

tachyon/crypto/sumcheck/multilinear/sumcheck_verifier.h

tachyon/math/elliptic_curves/msm/algorithms/pippenger/pippenger_adapter.h

tachyon/crypto/sumcheck/multilinear/sumcheck_verifier.h

dongchangYoo

LGTM

tachyon/math/polynomials/univariate/radix2_evaluation_domain.h

tachyon/math/polynomials/univariate/two_adic_subgroup.h

tachyon/math/polynomials/univariate/naive_batch_fft.h

- use `std::move()` where possible - release intermediate vector immediately - reserve `evals` size

…` strategy

Removes code duplication and optimizes potential parallelization. Previously, when running serially, allocations occurred but were not used. Additionally, when running with parallelization, the allocation was not parallelized. Now, if the input to `BatchNormalize()` is large enough, vector creation is parallelized properly and proper allocation usage is ensured.

- merge parallel loops since `F::GetSuccessivePowers()` uses parallel loops independently. - use `std::move()` where possible

This is needed to construct `BigInt` with `Eigen::Index` from macOS.

Reduces the number of thread joins and heap allocations. Additionally, fixes bugs in `GetSelectorsOnCoset()` replacing `chunk_size` with `len`.

batzor

LGTM

chokobole requested review from batzor, dongchangYoo, TomTaehoonKim, GideokKim and ashjeong August 7, 2024 11:42

chokobole force-pushed the perf/optimize-parallel-loop branch from aeb4754 to 446b215 Compare August 7, 2024 11:56

TomTaehoonKim reviewed Aug 8, 2024

View reviewed changes

tachyon/base/parallelize.h Outdated Show resolved Hide resolved

tachyon/crypto/sumcheck/multilinear/sumcheck_verifier.h Outdated Show resolved Hide resolved

tachyon/math/polynomials/univariate/radix2_evaluation_domain.h Show resolved Hide resolved

GideokKim reviewed Aug 8, 2024

View reviewed changes

tachyon/zk/air/plonky3/base/two_adic_multiplicative_coset.h Outdated Show resolved Hide resolved

GideokKim reviewed Aug 8, 2024

View reviewed changes

tachyon/zk/air/plonky3/base/two_adic_multiplicative_coset.h Outdated Show resolved Hide resolved

tachyon/zk/air/plonky3/base/two_adic_multiplicative_coset.h Outdated Show resolved Hide resolved

chokobole force-pushed the perf/optimize-parallel-loop branch from 4807e85 to 0018824 Compare August 8, 2024 08:24

GideokKim approved these changes Aug 8, 2024

View reviewed changes

TomTaehoonKim reviewed Aug 8, 2024

View reviewed changes

tachyon/crypto/sumcheck/multilinear/sumcheck_verifier.h Show resolved Hide resolved

ashjeong reviewed Aug 8, 2024

View reviewed changes

tachyon/math/polynomials/univariate/naive_batch_fft.h Outdated Show resolved Hide resolved

TomTaehoonKim approved these changes Aug 8, 2024

View reviewed changes

chokobole force-pushed the perf/optimize-parallel-loop branch 2 times, most recently from 379563f to c34c360 Compare August 8, 2024 09:36

ashjeong approved these changes Aug 8, 2024

View reviewed changes

batzor reviewed Aug 8, 2024

View reviewed changes

tachyon/crypto/sumcheck/multilinear/sumcheck_verifier.h Outdated Show resolved Hide resolved

batzor reviewed Aug 8, 2024

View reviewed changes

tachyon/math/elliptic_curves/msm/algorithms/pippenger/pippenger_adapter.h Outdated Show resolved Hide resolved

batzor reviewed Aug 8, 2024

View reviewed changes

tachyon/crypto/sumcheck/multilinear/sumcheck_verifier.h Show resolved Hide resolved

batzor reviewed Aug 8, 2024

View reviewed changes

tachyon/crypto/sumcheck/multilinear/sumcheck_verifier.h Outdated Show resolved Hide resolved

chokobole force-pushed the perf/optimize-parallel-loop branch from c34c360 to d2b6169 Compare August 9, 2024 00:37

dongchangYoo approved these changes Aug 9, 2024

View reviewed changes

batzor reviewed Aug 9, 2024

View reviewed changes

tachyon/math/polynomials/univariate/radix2_evaluation_domain.h Show resolved Hide resolved

tachyon/math/polynomials/univariate/two_adic_subgroup.h Show resolved Hide resolved

tachyon/math/polynomials/univariate/naive_batch_fft.h Show resolved Hide resolved

chokobole force-pushed the perf/optimize-parallel-loop branch from d2b6169 to bd0aee8 Compare August 9, 2024 03:19

chokobole added 3 commits August 9, 2024 12:19

perf(zk): avoid montgomery conversion by increment

e38b95a

fix(base): fix typo

13ac717

perf(crypto): optimize InterpolateUniPoly()

8eea789

- use `std::move()` where possible - release intermediate vector immediately - reserve `evals` size

chokobole added 8 commits August 9, 2024 12:19

fix(math): restore thread nums after MSM with `kParallelWidnowAndTerm…

3369de7

…` strategy

refac: use base::ParallelizeXXX with size

aa1a5fd

perf(math): optimize NaiveBatchFFT::FFTBatch()

2c4d800

- merge parallel loops since `F::GetSuccessivePowers()` uses parallel loops independently. - use `std::move()` where possible

feat(math): enable constructing BigInt with any integral values

39cf066

This is needed to construct `BigInt` with `Eigen::Index` from macOS.

perf: merge GetSuccessivePowers() into existing parallel loop

628bfe1

Reduces the number of thread joins and heap allocations. Additionally, fixes bugs in `GetSelectorsOnCoset()` replacing `chunk_size` with `len`.

fix: prevent possible overflow error

78f0c1b

fix(math): fix ScalarMul result of multiplying by zero

6b5b8a7

batzor approved these changes Aug 9, 2024

View reviewed changes

chokobole force-pushed the perf/optimize-parallel-loop branch from bd0aee8 to 6b5b8a7 Compare August 9, 2024 03:26

chokobole merged commit e3dc4dd into main Aug 9, 2024
7 checks passed

chokobole deleted the perf/optimize-parallel-loop branch August 9, 2024 03:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: optimize proof generation #509

perf: optimize proof generation #509

chokobole commented Aug 7, 2024 •

edited by ashjeong

Loading

TomTaehoonKim commented Aug 8, 2024

chokobole commented Aug 8, 2024 •

edited

Loading

GideokKim left a comment

ashjeong left a comment •

edited

Loading

TomTaehoonKim left a comment

ashjeong left a comment

dongchangYoo left a comment

batzor left a comment

perf: optimize proof generation #509

perf: optimize proof generation #509

Conversation

chokobole commented Aug 7, 2024 • edited by ashjeong Loading

Description

TomTaehoonKim commented Aug 8, 2024

chokobole commented Aug 8, 2024 • edited Loading

GideokKim left a comment

Choose a reason for hiding this comment

ashjeong left a comment • edited Loading

Choose a reason for hiding this comment

TomTaehoonKim left a comment

Choose a reason for hiding this comment

ashjeong left a comment

Choose a reason for hiding this comment

dongchangYoo left a comment

Choose a reason for hiding this comment

batzor left a comment

Choose a reason for hiding this comment

chokobole commented Aug 7, 2024 •

edited by ashjeong

Loading

chokobole commented Aug 8, 2024 •

edited

Loading

ashjeong left a comment •

edited

Loading