Add [pinned, unpinnned] matrix to ci_eval.yaml. #767

ScottTodd · 2025-01-07T00:39:50Z

Progress on #760. Depends on #765.

This changes ci_eval.yml (sharktank perplexity tests for llama) from testing:

iree perplexity (unpinned iree)
torch perplexity (unpinned iree-turbine)

to testing:

iree perplexity (unpinned iree)
iree perplexity (pinned iree)
torch perplexity (unpinned iree-turbine)

The pinned versions are shared across the project. We could individually update pins per test workflow or package, but I'd much rather keep the pins shared by default and leave fragmenting as an escape hatch as needed. I can update the other scheduled workflows to match this style if the changes here look good.

Triggered a test run here: https://github.com/nod-ai/shark-ai/actions/runs/12642858744.

archana-ramalingam · 2025-01-07T01:24:55Z

Progress on #760. Depends on #765.

This changes ci_eval.yml (sharktank perplexity tests for llama) from testing:

iree perplexity (unpinned iree)

torch perplexity (unpinned iree-turbine)

to testing:

iree perplexity (unpinned iree)

iree perplexity (pinned iree)

torch perplexity (unpinned iree-turbine)

The pinned versions are shared across the project. We could individually update pins per test workflow or package, but I'd much rather keep the pins shared by default and leave fragmenting as an escape hatch as needed. I can update the other scheduled workflows to match this style if the changes here look good.

Triggered a test run here: https://github.com/nod-ai/shark-ai/actions/runs/12642858744.

ci_eval.yaml perplexity test is intended to run on larger number of prompts with all attention versions of all Llama models, resulting in up to 6 tests in the future. Depending on the model size, each may take anywhere between 1 to 12 hours to complete. Given this is scheduled after IREE nightly release at 3 am PST, this might block dev resources in the AM. We might need to run them in dedicated CI machines, if we want to choose the matrix route.
We can do unpinned for schedule and pinned for pull_request/ push

ScottTodd · 2025-01-07T17:27:01Z

If we need more runners, I think we have plenty of extra machines that we can plug in. At this point, with multiple workflows unstable across unpinned versions, my main priority is getting back to a state where we know what works on which versions.

When we have more tests that take longer, we can

add more hardware runners to take jobs
shard tests across runners
make the tests faster

This matrix approach is a bit inflexible (all variants run whenever the workflow is triggered, individual jobs can't be run in isolation, canceling the job cancels all variants, et c.) but it is simple to understand and add to more workflows.

marbre

Not sure about where to run those / the runner resources but overall looks good to me.

requirements-iree-pinned.txt

…uirements-matrix

Progress on #760. We could make the scheduled jobs test both pinned and unpinned versions like on #767. Cleanup included here: * Dropped the "Installing the PyTorch CPU wheels saves multiple minutes and a lot of bandwidth on runner setup." comments since they are repetitive. Could add them back if people find them useful. * Stopped installing from the root `requirements.txt` in some workflows, instead opting to just install from the more specific `sharktank/requirements-tests.txt` I did not test the changes to scheduled workflows. Could do that on request, or just revert if we see issues.

ScottTodd requested review from marbre, archana-ramalingam and stbaione January 7, 2025 00:40

Add [pinned, unpinnned] matrix to ci_val.yaml.

fa7ab97

ScottTodd force-pushed the users/scotttodd/requirements-matrix branch from a0f261c to fa7ab97 Compare January 7, 2025 17:14

marbre approved these changes Jan 10, 2025

View reviewed changes

requirements-iree-pinned.txt Outdated Show resolved Hide resolved

Merge remote-tracking branch 'upstream/main' into users/scotttodd/req…

d4f56bf

…uirements-matrix

ScottTodd mentioned this pull request Jan 10, 2025

Switch workflows to use new requirements-iree-*.txt files. #813

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add [pinned, unpinnned] matrix to ci_eval.yaml. #767

Add [pinned, unpinnned] matrix to ci_eval.yaml. #767

ScottTodd commented Jan 7, 2025

archana-ramalingam commented Jan 7, 2025 •

edited

Loading

ScottTodd commented Jan 7, 2025

marbre left a comment

Add [pinned, unpinnned] matrix to ci_eval.yaml. #767

Are you sure you want to change the base?

Add [pinned, unpinnned] matrix to ci_eval.yaml. #767

Conversation

ScottTodd commented Jan 7, 2025

archana-ramalingam commented Jan 7, 2025 • edited Loading

ScottTodd commented Jan 7, 2025

marbre left a comment

Choose a reason for hiding this comment

archana-ramalingam commented Jan 7, 2025 •

edited

Loading