Reject duplicate results when handling efficiencies #65

Pennycook · 2024-08-22T14:32:44Z

Removing data from a user-supplied DataFrame might impact certain properties of the data (e.g., the order in which applications, platforms, and/or problems appear).

Rather than complicate our implementation with workarounds that might not address every possible use-case, we can simply detect and reject problematic data.

Related issues

This effectively reverts #22. It's an alternative solution to the one proposed in #63.

Proposed changes

Reject duplicate (application, platform) pairs when calculating PP and plotting cascades.
Prevent pp from sorting implicitly during its groupby operation.
Update tests to match new expected behavior.

The upshot of the changes here is intended to be:

For simple datasets with no duplicate results, there is no change in behavior.
For complex datasets with duplicate results (which may occur after projection), we throw a ValueError.
All of our calculation and plotting functions should now respect the order of the data given to them, so the user regains the ability to control the order of applications and platforms in the legend simply by sorting their data.

In my own offline testing of complex P3 workflows, I've found that I need to insert an additional line to prepare data the way I typically want it to be plotted:

eff_df = p3.metrics.application_efficiency(projected_df, foms="higher")
eff_df = eff_df.sort_values("app eff").drop_duplicates(["application", "platform"], keep="last").sort_index()
cascade = p3.plot.cascade(eff_df)

I don't think this is too bad, and it only shows up in complicated cases. If we wanted to simplify this workflow, we could consider introducing something like:

eff_df = p3.metrics.application_efficiency(projected_df, foms="higher", keep="best") # or keep="all", keep="latest"
cascade = p3.plot.cascade(eff_df)

...but I'd want to explore that separately, to make sure that we design and test it properly.

Previously, the validity of the efficiency column was being checked by the matplotlib and pgfplots backends instead of by the dispatch code. Centralizing the checks will ensure they stay in sync. Signed-off-by: John Pennycook <[email protected]>

Our previous attempt at solving this problem (see intel#22) had unexpected knock-on effects, since removing data from a user-supplied DataFrame might impact certain properties of the data (e.g., the order in which applications, platforms, and/or problems appear). Rather than complicate our implementation with workarounds that might not address every possible use-case, we can simply detect and reject problematic data. This change slightly complicates the process of working with large data, but ensures that users are always in control over which data is plotted. Signed-off-by: John Pennycook <[email protected]>

Since we're changing the way that duplicates are handled by PP, the newest version fails the original test. Signed-off-by: John Pennycook <[email protected]>

By default, groupby sorts the DataFrame. This leads to weird reordering effects when pp is used in conjunction with cascade plots and navcharts. Signed-off-by: John Pennycook <[email protected]>

The previous "expected" test result had actually been chosen based on the empirical behavior of the library. If we expect the output DataFrame to remain unsorted, we should test for that. Signed-off-by: John Pennycook <[email protected]>

swright87

This all looks reasonable to me and I think its good that it also cleans up some duplicated backend code to a single place.

I actually like the idea of a "keep=best" type argument also. I would be in favour of that in a separate issue with, error, best, latest, etc as options?

laserkelvin

LGTM - not sure if you wanted to explain/document why the ValueError is raised when pairs are not unique

p3/metrics/_pp.py

p3/plot/_cascade.py

Signed-off-by: John Pennycook <[email protected]>

Pennycook added 5 commits August 22, 2024 14:09

Modify PP duplicates test

3715d7c

Since we're changing the way that duplicates are handled by PP, the newest version fails the original test. Signed-off-by: John Pennycook <[email protected]>

Prevent pp from sorting during calculation

666d81a

By default, groupby sorts the DataFrame. This leads to weird reordering effects when pp is used in conjunction with cascade plots and navcharts. Signed-off-by: John Pennycook <[email protected]>

Update pp test to match new sorting behavior

47d7110

The previous "expected" test result had actually been chosen based on the empirical behavior of the library. If we expect the output DataFrame to remain unsorted, we should test for that. Signed-off-by: John Pennycook <[email protected]>

Pennycook added bug Something isn't working enhancement New feature or request labels Aug 22, 2024

Pennycook requested review from swright87 and laserkelvin August 22, 2024 14:32

swright87 approved these changes Aug 27, 2024

View reviewed changes

Pennycook mentioned this pull request Aug 28, 2024

Ensure application order is respected in cascade #63

Closed

laserkelvin approved these changes Aug 29, 2024

View reviewed changes

p3/metrics/_pp.py Outdated Show resolved Hide resolved

p3/plot/_cascade.py Outdated Show resolved Hide resolved

Explain why we reject duplicate (app, plat) pairs

9db17ae

Signed-off-by: John Pennycook <[email protected]>

Pennycook merged commit ef826b2 into intel:main Aug 29, 2024
7 checks passed

Pennycook deleted the reject-duplicate-results branch August 29, 2024 18:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reject duplicate results when handling efficiencies #65

Reject duplicate results when handling efficiencies #65

Pennycook commented Aug 22, 2024

swright87 left a comment

laserkelvin left a comment

Reject duplicate results when handling efficiencies #65

Reject duplicate results when handling efficiencies #65

Conversation

Pennycook commented Aug 22, 2024

Related issues

Proposed changes

swright87 left a comment

Choose a reason for hiding this comment

laserkelvin left a comment

Choose a reason for hiding this comment