Update coverage file in BabelStream case study #51

Pennycook · 2024-07-18T18:43:14Z

We forgot to update the BabelStream case study when we changed the coverage schema. Running the example as it was gave an error due to schema validation.

This commit is a best-effort attempt to convert the old coverage file into the new format. It is "best-effort" because:

The old format did not include filenames, but the new format requires them. Since the case study already uses anonymized platforms and languages, the simplest solution here was to generate names like "file1" and "file2".
The old format stored lines as regions containing the number of "real" lines of code in the range (start, end), rather than storing the specific lines of code that were used. However, since all the data was generated by the same version of Code Base Investigator parsing the same files, all the regions line up. The simplest solution therefore was to pretend that the region represented a contiguous run of lines. The information about the number of comments and amount of whitespace in a region is destroyed by this transformation, but it isn't necessary to compute divergence.

Related issues

Closes #50.

Proposed changes

Replace outdated coverage.csv with one using coverage strings that conform to 0.3.0 schema.

We forgot to update the BabelStream case study when we changed the coverage schema. Running the example as it was gave an error due to schema validation. This commit is a best-effort attempt to convert the old coverage file into the new format. It is "best-effort" because: - The old format did not include filenames, but the new format requires them. Since the case study already uses anonymized platforms and languages, the simplest solution here was to generate names like "file1" and "file2". - The old format stored lines as regions containing the number of "real" lines of code in the range (start, end), rather than storing the specific lines of code that were used. However, since all the data was generated by the same version of Code Base Investigator parsing the same files, all the regions line up. The simplest solution therefore was to pretend that the region represented a contiguous run of lines. The information about the number of comments and amount of whitespace in a region is destroyed by this transformation, but it isn't necessary to compute divergence. Signed-off-by: John Pennycook <[email protected]>

- Switches to furo theme (#47) - Updates examples to new style (#51, #67) - Adds new examples (#54, #55, #58) - Marks the documentation unstable (#72) Signed-off-by: John Pennycook <[email protected]>

Pennycook added bug Something isn't working documentation Improvements or additions to documentation labels Jul 18, 2024

Pennycook added this to the 1.0.0 milestone Jul 18, 2024

Pennycook requested review from swright87 and laserkelvin July 18, 2024 18:43

Pennycook force-pushed the update-babelstream-coverage branch from 452cc26 to f737bfb Compare July 18, 2024 18:48

laserkelvin approved these changes Jul 19, 2024

View reviewed changes

Pennycook merged commit c8058fa into intel:main Jul 19, 2024
3 checks passed

Pennycook deleted the update-babelstream-coverage branch July 19, 2024 19:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update coverage file in BabelStream case study #51

Update coverage file in BabelStream case study #51

Pennycook commented Jul 18, 2024 •

edited

Loading

Update coverage file in BabelStream case study #51

Update coverage file in BabelStream case study #51

Conversation

Pennycook commented Jul 18, 2024 • edited Loading

Related issues

Proposed changes

Pennycook commented Jul 18, 2024 •

edited

Loading