Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Upgrade HPU image to v1.16.2. #5610

Merged
Merged
Changes from 3 commits
Commits
Show all changes
19 commits
Select commit Hold shift + click to select a range
3881960
Upgrade HPU image to v1.16.0
vshekhawat-hlab Jun 4, 2024
66d1f1b
Merge branch 'master' into upgrade-hpu-and-test-coverage
vshekhawat-hlab Jun 5, 2024
3f90328
Merge branch 'master' into upgrade-hpu-and-test-coverage
loadams Jun 5, 2024
c52ec11
Merge branch 'master' into upgrade-hpu-and-test-coverage
loadams Jun 5, 2024
d5fe69e
Merge branch 'master' into upgrade-hpu-and-test-coverage
loadams Jun 12, 2024
fa522de
Merge branch 'master' into upgrade-hpu-and-test-coverage
loadams Jun 12, 2024
b3211b9
Merge branch 'master' into upgrade-hpu-and-test-coverage
loadams Jun 12, 2024
a71ce4a
Updated HPU image to 1.16.2.
vshekhawat-hlab Jun 26, 2024
a56c094
Merge branch 'master' into upgrade-hpu-and-test-coverage
vshekhawat-hlab Jun 26, 2024
f47840f
Fix typo on docker image name
loadams Jun 26, 2024
105c4b4
Typo part 2
loadams Jun 26, 2024
457ba74
Enabled dmesg logs and added hl-smi list command.
vshekhawat-hlab Jun 27, 2024
82b166a
Merge branch 'master' into upgrade-hpu-and-test-coverage
vshekhawat-hlab Jun 27, 2024
8b5061a
Merge branch 'master' into upgrade-hpu-and-test-coverage
loadams Jul 16, 2024
3ec2c43
Remove dmesg and duplicate hl-smi cmd from hpu-gaudi2 workflow.
vshekhawat-hlab Jul 17, 2024
8dc9078
Merge branch 'master' into upgrade-hpu-and-test-coverage
vshekhawat-hlab Jul 17, 2024
40102ba
Merge branch 'master' into upgrade-hpu-and-test-coverage
loadams Jul 25, 2024
2635ba2
Merge branch 'master' into upgrade-hpu-and-test-coverage
loadams Jul 29, 2024
ae1bdf2
Merge branch 'master' into upgrade-hpu-and-test-coverage
loadams Aug 13, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 3 additions & 2 deletions .github/workflows/hpu-gaudi2.yml
Original file line number Diff line number Diff line change
Expand Up @@ -39,13 +39,14 @@ jobs:
# The type of runner that the job will run on
runs-on: [self-hosted, intel, gaudi2]
container:
image: vault.habana.ai/gaudi-docker/1.15.1/ubuntu22.04/habanalabs/pytorch-installer-2.2.0:latest
image: vault.habana.ai/gaudi-docker/1.16.0/ubuntu22.04/habanalabs/pytorch-installer-2.2.2:latest
loadams marked this conversation as resolved.
Show resolved Hide resolved
ports:
- 80
options: --runtime=habana -e HABANA_VISIBLE_DEVICES=all -e OMPI_MCA_btl_vader_single_copy_mechanism=none --cap-add=sys_nice

env:
PT_HPU_LAZY_MODE: 0
TORCHINDUCTOR_COMPILE_THREADS: 1
TEST_LIST: |
test_accelerator.py
test_autotuning.py
Expand Down Expand Up @@ -129,7 +130,7 @@ jobs:
unset TORCH_CUDA_ARCH_LIST # only jit compile for current arch
cd tests
export PT_HPU_LAZY_MODE=${PT_HPU_LAZY_MODE}
export TORCHINDUCTOR_COMPILE_THREADS=${TORCHINDUCTOR_COMPILE_THREADS}
TEST_LIST=$(echo "$TEST_LIST" | awk 'NF{printf "%s%s", (NR>1 ? " or " : ""), $0} END{if (NR>1) print ""}')
echo "TEST_LIST ${TEST_LIST}"
echo "PT_HPU_LAZY_MODE ${PT_HPU_LAZY_MODE}"
pytest --verbose unit/ -k "${TEST_LIST}"
Loading