(WIP) Refactor row major matrix #624

mcalancea · 2024-11-22T14:14:15Z

No description provided.

hero78119 · 2024-11-22T15:32:02Z

This is a great effort! As some moment I am also curious what would be the impact if we remove MaybeUint feature and use raw vector as we only have one implementation.

I just do a very quick tried on this branch against master branch on riscv_add benchmark with command on remote ceno benchmark machine, with command

### on master branch
cargo bench --bench riscv_add --package ceno_zkvm -- --save-baseline baseline

### on this branch
cargo bench --bench riscv_add --package ceno_zkvm -- --baseline baseline

and the result turns out to be ~+6% slower on e2e latency.

## master branch
add_op_20/prove_add/prove_add_log2_20
                        time:   [1.5172 s 1.5615 s 1.6110 s]

### this branch
add_op_20/prove_add/prove_add_log2_20
                        time:   [1.6454 s 1.6695 s 1.6898 s]
                        change: [+4.1692% +6.3642% +8.4701%] (p = 0.00 < 0.05)

I believed with fibonacchi example it might be even much slower. So it's still nessesary to this feature for keeping high performance.

hero78119 · 2024-11-22T15:35:51Z

If we just talked about capture this unssignment error, I think we can achieve it from different path: in unittest we support init vector into 2 different default value with same execution trance, and then compare 2 witnesses which should be identical. An unequality indicate there are some unassigment bug. And with that, we can capture the unassignment error in early stage.

mcalancea · 2024-11-22T15:52:51Z

@hero78119

Thank you! This is still work in progress; I'm examining some ways to make it faster while keeping it clean/safe. My bench results are a bit better for some reason:

add_op_20/prove_add/prove_add_log2_20
                        time:   [4.7420 s 4.7740 s 4.8031 s]
                        change: [+1.3563% +2.2286% +3.0911%] (p = 0.00 < 0.05)
                        Performance has regressed.
add_op_21/prove_add/prove_add_log2_21
                        time:   [9.9516 s 10.189 s 10.450 s]
                        change: [-1.7437% +0.5352% +2.8403%] (p = 0.70 > 0.05)
                        No change in performance detected.

What regression % (evaluated at yours) would you consider acceptable?

naure

Looking nice so far 👍

naure · 2024-11-22T15:46:22Z

ceno_zkvm/src/witness.rs

-    pub fn num_instances(&self) -> usize {
-        self.values.len() / self.num_col - self.num_padding_rows
+    pub fn len(&self) -> usize {
+        self.values.len()


Confusing name. Maybe total_len, num_cells, …?

naure · 2024-11-22T15:52:16Z

ceno_zkvm/src/witness.rs

+        let start = Instant::now();
+        let padding_row = match self.padding_strategy {
+            InstancePaddingStrategy::RepeatLast => {
+                self.values[self.values.len() - self.num_col..].to_vec()


What if empty?

There was a special case if steps.is_empty() somewhere:

InstancePaddingStrategy::RepeatLast if steps.is_empty() => { tracing::debug!("No {} steps to repeat, using zero padding", Self::name()); vec![MaybeUninit::new(E::BaseField::ZERO); num_witin] }

although that was not completely correct either; better than crashing.

matthiasgoergens · 2024-11-25T03:55:19Z

Could you please add to the PR description a very brief overview over what we are doing, and more importantly why?

(The what can be really brief, because the text of the PR answers that question in detail.)

Thanks!

matthiasgoergens · 2024-11-25T04:04:03Z

This is a great effort! As some moment I am also curious what would be the impact if we remove MaybeUint feature and use raw vector as we only have one implementation.

Yes, I have been rather suspicious of MaybUninit. It looks like a bit of a minefield, and I'm not sure it's actually worth it.

hero78119 · 2024-11-25T04:28:05Z

... I'm examining some ways to make it faster while keeping it clean/safe. My bench results are a bit better for some reason:
add_op_20/prove_add/prove_add_log2_20
                        time:   [4.7420 s 4.7740 s 4.8031 s]
                        change: [+1.3563% +2.2286% +3.0911%] (p = 0.00 < 0.05)
                        Performance has regressed.
...
What regression % (evaluated at yours) would you consider acceptable?

I noticed the different with yours vs remote ceno benchmark server probably on environment number of cores.
In multicore environment (16x2 cores), removing MaybeUinit the regression will be more significant, e.g. + ~6% regressed.

To me any regression of % is somehow unacceptable.

So I would suggest to go from another way: in unittest initialized with 2 different default value with same witness, then check the witness polynomial equality. This help us to identify the problem while not sacrificing performance.

matthiasgoergens · 2024-11-25T06:52:16Z

If you want some inspiration for how to do this kind of change, you might want to have a look at how plonky3 changed the organisation of its data compared to plonky2. (It might be a bit much too read, though.)

naure · 2024-11-25T09:54:27Z

@hero78119 There is a draft of the approach based on testing here: #597

naure · 2024-11-25T10:05:20Z

To get back to optimal performance without unsafe types, there could be another constructor that accepts a vector or an iterator. Callers can built their Vec with e.g. flatten and collect.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(WIP) Refactor row major matrix #624

(WIP) Refactor row major matrix #624

mcalancea commented Nov 22, 2024

hero78119 commented Nov 22, 2024 •

edited

Loading

hero78119 commented Nov 22, 2024

mcalancea commented Nov 22, 2024 •

edited

Loading

naure left a comment

naure Nov 22, 2024

naure Nov 22, 2024

matthiasgoergens commented Nov 25, 2024

matthiasgoergens commented Nov 25, 2024

hero78119 commented Nov 25, 2024 •

edited

Loading

matthiasgoergens commented Nov 25, 2024

naure commented Nov 25, 2024

naure commented Nov 25, 2024

(WIP) Refactor row major matrix #624

Are you sure you want to change the base?

(WIP) Refactor row major matrix #624

Conversation

mcalancea commented Nov 22, 2024

hero78119 commented Nov 22, 2024 • edited Loading

hero78119 commented Nov 22, 2024

mcalancea commented Nov 22, 2024 • edited Loading

naure left a comment

Choose a reason for hiding this comment

naure Nov 22, 2024

Choose a reason for hiding this comment

naure Nov 22, 2024

Choose a reason for hiding this comment

matthiasgoergens commented Nov 25, 2024

matthiasgoergens commented Nov 25, 2024

hero78119 commented Nov 25, 2024 • edited Loading

matthiasgoergens commented Nov 25, 2024

naure commented Nov 25, 2024

naure commented Nov 25, 2024

hero78119 commented Nov 22, 2024 •

edited

Loading

mcalancea commented Nov 22, 2024 •

edited

Loading

hero78119 commented Nov 25, 2024 •

edited

Loading