Consenus mode #42

ravyu-jump · 2024-05-28T22:13:24Z

Adds a "consensus mode" flag which, among other things, normalizes error codes between effects. This way, tests do not fail in scenarios that don't break consensus (i.e., different error codes)

Also did some cleanup by removing check_consistency_in_results in multiprocessing_utils.. We have a lot more cleaning up to do

src/test_suite/test_suite.py

mjain-jump · 2024-05-28T22:25:50Z

src/test_suite/test_suite.py

+    if consensus_mode:
+        original_diff_effects_fn = globals.harness_ctx.diff_effect_fn
+
+        def diff_effect_wrapper(a, b):


Does this take into account scenarios where there are multiple fields that need to be normalized? For example, for InstrEffects there's error codes and custom error codes that should both be ignored

Nope, this stuff is hardcoded unfortunately. Didn't see any output in custom_err so I kinda ignored it for now.

Wasn't really sure of how to deal with different effects having different fields to ignore in a consensus mode run. The fact that we want to modify things in place for output also complicates things quite a bit. Perhaps we can define a separate diff_effects_consensus_fn as a part of the interface?

How about we make globals.harness_ctx.result_field_names a list and iterate over it with a for loop within this function? Then for example if someone wanted to find the passing cases if you ignore, for example, CU's, then they could just add that to the list. You can keep the list empty as default, and someone who wants to modify the diff behavior can just add the ignored fields to the list

So I don't exactly just ignore the result fields themselves, just that if they both have error codes. So a generic "ignore fields" list won't apply

ravyu-jump requested review from jumpsiegel and mjain-jump May 28, 2024 22:13

ravyu-jump added 4 commits May 28, 2024 22:15

initial working ignore diff effects mode

160a8af

fix incorrect output for groundtruth target

be19964

rename to consensus mode

de6c138

rename instruction effects

0ac7d2a

ravyu-jump force-pushed the consenus-mode branch from f267846 to 0ac7d2a Compare May 28, 2024 22:16

mjain-jump requested changes May 28, 2024

View reviewed changes

src/test_suite/test_suite.py Outdated Show resolved Hide resolved

src/test_suite/test_suite.py Outdated Show resolved Hide resolved

address PR comments

d2020df

mjain-jump requested changes May 28, 2024

View reviewed changes

mjain-jump approved these changes May 29, 2024

View reviewed changes

ravyu-jump merged commit 710b9ce into main May 29, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consenus mode #42

Consenus mode #42

ravyu-jump commented May 28, 2024 •

edited

Loading

mjain-jump May 28, 2024

ravyu-jump May 28, 2024

mjain-jump May 28, 2024

ravyu-jump May 28, 2024

Consenus mode #42

Consenus mode #42

Conversation

ravyu-jump commented May 28, 2024 • edited Loading

mjain-jump May 28, 2024

Choose a reason for hiding this comment

ravyu-jump May 28, 2024

Choose a reason for hiding this comment

mjain-jump May 28, 2024

Choose a reason for hiding this comment

ravyu-jump May 28, 2024

Choose a reason for hiding this comment

ravyu-jump commented May 28, 2024 •

edited

Loading