Starting tagging functionality #65

pearce8 · 2023-12-01T14:34:13Z

Set of tags for benchpark: A central place to keep all the possible tag values.
Need to be able to read tags in application.py, whether in benchpark/repo, or in the Ramble built-ins.
Need functionality to report on the overall tags covered by experiments.
Need functionality to report on the overall tags for each suite.
When adding new tags to experiments, should check on PR that a tag exists, and encourage the users to pick from existing tags. If a new tag is needed, propose to add it to the benchpark_tags.yaml before starting to use it in experiments.

scheibelp · 2023-12-02T00:03:25Z

I think this should be automatically constructed from ramble attributes --tags --all. Note that this satisfies:

Need to be able to read tags in application.py, whether in benchpark/repo, or in the Ramble built-ins.

as long as the ramble config includes both of these repos.

Need functionality to report on the overall tags for each suite.

what is a suite? Is it just another tag that happens to be used for organization? For example I don't see a designated suite attribute for Ramble application objects.

If a new tag is needed, propose to add it to the benchpark_tags.yaml before starting to use it in experiments.

FWIW, one of our CI checks could be to auto-collect the tags and compare them with benchpark_tags.yaml, and fail automatically if they don't match. Then any PR would have to update benchpark_tags.yaml as part of the diff (we can likewise provide a command to do that update automatically.

bin/benchpark_tags.yaml

becker33 · 2023-12-05T20:51:02Z

Some tags seem to be per-application and some tags seem to be per-experiment (since applications can stress the system in different ways with different arguments. Should we check with the Ramble developers about adding a tagging capability to the ramble.yaml syntax.

These tags should then filter into benchpark list so that users can select tags for attributes of the system they want to test and get a list of potentially relevant experiments to choose from. Maybe benchpark list -t memory-bandwidth

Maybe we should publish (potentially outside of benchpark?) a taxonomy of benchmarking types which could be used as the set of available tags in benchpark? I'm not sure whether this should be coupled into benchpark or treated as an external dependency? I guess it's easier to start internal and spin out if necessary.

becker33 · 2023-12-05T20:53:36Z

@scheibelp I think that we may want our list to be separate from the list of tags used in Ramble, because I think there are tags relevant to experiments that are not applicable at the application level.

I agree that we should have a CI check that all tags in our repo are in the list of tags, but I don't know that we want to necessarily care about every tag that ramble has in upstream packages. If someone tags an application as "google" because they developed it, we probably don't care from a benchpark perspective

scheibelp · 2023-12-06T01:14:25Z

I think that we may want our list to be separate from the list of tags used in Ramble, because I think there are tags relevant to experiments that are not applicable at the application level.

Agree, I think that was a possible reason for expanding the YAML format from a list of tags to a dict, which was collapsed in #65 (comment) (I should have made that a separate comment vs. an annotation).

I agree that we should have a CI check that all tags in our repo are in the list of tags, but I don't know that we want to necessarily care about every tag that ramble has in upstream packages. If someone tags an application as "google" because they developed it, we probably don't care from a benchpark perspective

Does this just mean that we might not want to display all tags that exist in all application.py's (and you think it makes sense to report a problem if a user mentions a tag that is not in an application.py, or possibly in a "group" as proposed above in this comment)? Without that grouping, I think tags outside of the set exported by applications aren't useful, unless you had some notion of another reason we could use them.

alecbcs

Looks good to me!

Starting tagging functionality

00724e2

scheibelp self-assigned this Dec 1, 2023

scheibelp reviewed Dec 2, 2023

View reviewed changes

bin/benchpark_tags.yaml Outdated Show resolved Hide resolved

pearce8 added 3 commits December 5, 2023 14:18

Update benchpark_tags.yaml

47ab9d9

Merge branch 'develop' into feature/tagging

292d44b

Update benchpark_tags.yaml

ccd13e5

pearce8 added 2 commits December 18, 2023 17:46

copyright

08fe3b1

Merge branch 'develop' into feature/tagging

fb938e1

pearce8 added the feature New feature or request label Dec 19, 2023

pearce8 added 4 commits December 19, 2023 03:31

trying to fix lint

03e2f71

Merge branch 'develop' into feature/tagging

60904ed

Merge branch 'develop' into feature/tagging

87b3d50

Adding tags

70b35a5

github-actions bot added the application New or modified application label Jan 9, 2024

pearce8 added 10 commits January 9, 2024 09:14

Adding tags

84c3770

Adding tags

d79d247

adding tags

d3d3ea5

adding tags

12838a2

adding tags

3a0ca27

Expanding benchmark tags options

5dcd37f

Update tags

41ca661

Merge branch 'develop' into feature/tagging

bc04a8e

Merge branch 'develop' into feature/tagging

2c78ce0

Merge branch 'develop' into feature/tagging

b5aa5e8

pearce8 added this to the v0.1.1 milestone Jan 20, 2024

pearce8 linked an issue Jan 20, 2024 that may be closed by this pull request

benchpark tags #22

Closed

pearce8 and others added 6 commits February 21, 2024 16:14

Merging in develop

9b121b0

use benchpark tags command

3c0ac74

remove commented out code

4e89b82

remove unused parsing script for tags

b9fceaa

black

f6555d5

remove unused import

4733304

slabasan marked this pull request as ready for review February 22, 2024 06:23

pearce8 and others added 8 commits February 22, 2024 19:36

Merge branch 'develop' into feature/tagging

01974bd

Merge branch 'develop' into feature/tagging

5268517

Merge branch 'develop' into feature/tagging

914f161

Merge branch 'develop' into feature/tagging

fd598fb

Merge branch 'develop' into feature/tagging

51318c5

Merge branch 'develop' into feature/tagging

9413be1

Add CI setup to install ramble for use in docs generation

5e3b193

add tag table generator script to makefile

f773524

github-actions bot added the ci Involving Project CI & Unit Tests label Mar 13, 2024

run all

e2fb13d

alecbcs force-pushed the feature/tagging branch 2 times, most recently from e3c84c9 to 61ebff7 Compare March 13, 2024 21:52

Add yaml dependency to docs CI

0fc7013

alecbcs force-pushed the feature/tagging branch from 61ebff7 to 0fc7013 Compare March 13, 2024 21:55

slabasan and others added 3 commits March 13, 2024 14:59

fixes

44dd1a8

sphinxcontrib-programoutput req

77ff056

Fix codspell looking in the rendered docs

f1f28ca

github-actions bot added the dependencies Modifications to a Dependency File label Mar 13, 2024

slabasan added 2 commits March 13, 2024 15:42

remove tables/*.csv

1adeb7f

run benchmark tags script

b06808e

alecbcs approved these changes Mar 13, 2024

View reviewed changes

alecbcs merged commit f3cfc47 into develop Mar 13, 2024
7 checks passed

slabasan deleted the feature/tagging branch March 14, 2024 19:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Starting tagging functionality #65

Starting tagging functionality #65

pearce8 commented Dec 1, 2023

scheibelp commented Dec 2, 2023

becker33 commented Dec 5, 2023

becker33 commented Dec 5, 2023

scheibelp commented Dec 6, 2023

alecbcs left a comment

Starting tagging functionality #65

Starting tagging functionality #65

Conversation

pearce8 commented Dec 1, 2023

scheibelp commented Dec 2, 2023

becker33 commented Dec 5, 2023

becker33 commented Dec 5, 2023

scheibelp commented Dec 6, 2023

alecbcs left a comment

Choose a reason for hiding this comment