Tracking performance testing - collect and analyze results of benchmark runs #5464

thearusable · 2022-09-16T14:29:25Z

Related to #5070 2.5(Propose a mechanism to track performance over time):

have a repo inside the kokkos organization where to store all the json files: let's call this the database-repo for now

push there (somehow) all the json files generated by the benchmark for example every time a PR is merged into main branch

have a script inside database-repo that automatically generates plots every time files are updated

When the Performance Test will be run on CI we can add an additional step which will upload all generated json files to database-repo.

There is a GitHub Action which can be used: https://github.com/marketplace/actions/push-a-file-to-another-repository
This GA only needs a API_TOKEN_GITHUB to be set in Secrets section of your repository options.

After pushing benchmark results to database-repo there will be GitHub Action run in database-repo. The sole purpose of that action will be to run the analyzing software that will generate the required output data.
To analyze the benchmark data we can use: https://github.com/bensanmorris/benchmark_monitor

By default this tool will generate chart with benchmark runs and HTML index file based on the Jinja2 template. This template could be modified to include benchmark context data.
Update: By default the images will be saved using following format BenchamrkName-metric.png so for example BM_SomeFunction-real_time.png.

Generated files can be pushed using GitHub Actions to GitHub Pages. Github Action that can be used: https://github.com/marketplace/actions/deploy-to-github-pages

Example chart:

TODO:

I will create a quick demo on my repositories

The text was updated successfully, but these errors were encountered:

cz4rs · 2022-09-16T14:37:21Z

Looks like it should "just work", thumbs up for using preexisting components!

fnrizzi · 2022-09-21T14:51:20Z

thanks, look like a good solution!
Some comments:

i think we should also find a way to save plots with some filename convention
we need to figure out if the propose solution is ok in terms of permissions etc , that is up to @crtrott @dalg24

after they give the ok, we can try to do the prototype

thearusable · 2022-09-22T13:40:15Z

By default the images will be saved using following format BenchamrkName-metric.png so for example BM_SomeFunction-real_time.png.

If we want a different name format then we will need to modify the benchmark_monitor.py script.

masterleinad · 2022-09-22T13:49:21Z

My biggest concern is that we are nowadays mostly interested in performance on GPU backends and that the machines we are running CI on don't produce good/consistent enough results for performance regression testing.

thearusable · 2022-09-22T16:04:24Z

Github action on database-repo can be configured to create new charts after any commit on branch(main). So it can be triggered by a commit from CI or by commit made manually.

We can make a separate directories to store benchmark results from CI and from target machines/proper benchmark machines.

cz4rs · 2022-11-30T18:45:45Z

Current status:

gather benchmark results and push them to a dedicated repository:
- implemented on a performance-results-visualization branch, see the changes here
- using a fork of dmnemec/copy_file_to_another_repo_action (customized for our needs)
- pushes the jsons to kokkos-benchmark-results repository
process the results:
- when the files get pushed to kokkos-benchmark-results, process them with benchmark_monitor.py and push generated content to a github pages instance
- https://github.com/cz4rs/kokkos-benchmark-results/blob/master/.github/workflows/ci.yml
you can see the resulting graphs here: https://cz4rs.github.io/
sample benchmark results with git info included

TODO:

use commit hashes to identify builds in graphs (~~after #5348: Add git information to benchmark metadata #5463 gets merged~~ edit: merged recently, work in progress)
use the metric that is prefixed with "FOM" (figure of merit) automatically
- potentially: ensure that all the benchmarks contain such metric on kokkos side
make it easier to search for specific results

cz4rs · 2023-05-24T13:44:58Z

Performance results are collected and stored in https://github.com/kokkos/kokkos-benchmark-results.

thearusable self-assigned this Sep 16, 2022

thearusable added the Feature Request Create new capability; will potentially require voting label Sep 16, 2022

thearusable changed the title ~~Tracking performance testing - collect and analyze result of benchmark runs~~ Tracking performance testing - collect and analyze results of benchmark runs Sep 16, 2022

thearusable mentioned this issue Sep 16, 2022

[do not close] NGA/task 4: tracking performance testing task #5070

Closed

6 tasks

cz4rs self-assigned this Oct 26, 2022

cz4rs added Enhancement Improve existing capability; will potentially require voting and removed Enhancement Improve existing capability; will potentially require voting labels Dec 7, 2022

This was referenced Jan 4, 2023

Tracking performance results: Collect performance results #5689

Merged

Add benchmark results processing and deployment kokkos/kokkos-benchmark-results#1

Open

cz4rs closed this as completed May 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracking performance testing - collect and analyze results of benchmark runs #5464

Tracking performance testing - collect and analyze results of benchmark runs #5464

thearusable commented Sep 16, 2022 •

edited

Loading

cz4rs commented Sep 16, 2022

fnrizzi commented Sep 21, 2022

thearusable commented Sep 22, 2022

masterleinad commented Sep 22, 2022

thearusable commented Sep 22, 2022

cz4rs commented Nov 30, 2022 •

edited

Loading

cz4rs commented May 24, 2023

Tracking performance testing - collect and analyze results of benchmark runs #5464

Tracking performance testing - collect and analyze results of benchmark runs #5464

Comments

thearusable commented Sep 16, 2022 • edited Loading

cz4rs commented Sep 16, 2022

fnrizzi commented Sep 21, 2022

thearusable commented Sep 22, 2022

masterleinad commented Sep 22, 2022

thearusable commented Sep 22, 2022

cz4rs commented Nov 30, 2022 • edited Loading

cz4rs commented May 24, 2023

thearusable commented Sep 16, 2022 •

edited

Loading

cz4rs commented Nov 30, 2022 •

edited

Loading