From 883801ff0f505eef8253d5ecfbe02ae348fc07ad Mon Sep 17 00:00:00 2001 From: Sophie Schoenmeyer <107952697+sophies927@users.noreply.github.com> Date: Tue, 28 May 2024 22:22:36 +0000 Subject: [PATCH 1/6] Add ACPT docs to ORT docs --- docs/ecosystem/acpt.md | 42 +++++++++++++++++++++++++++++++++++++++++ docs/ecosystem/index.md | 4 ++++ 2 files changed, 46 insertions(+) create mode 100644 docs/ecosystem/acpt.md diff --git a/docs/ecosystem/acpt.md b/docs/ecosystem/acpt.md new file mode 100644 index 0000000000000..e7ecee2cdd520 --- /dev/null +++ b/docs/ecosystem/acpt.md @@ -0,0 +1,42 @@ +--- +title: Azure Container for PyTorch (ACPT) +description: Learn more about Azure Container for PyTorch (ACPT) and how it utilizes ONNX Runtime +nav_order: 9 +redirect_from: /docs/tutorials/ecosystem/acpt +--- +# Azure Container for PyTorch (ACPT) +{: .no_toc } + +Azure Container for PyTorch (ACPT) is a lightweight, standalone environment that includes needed components to effectively run optimized training for large models. It helps with reducing preparation costs and faster deployment time. ACPT can be used to quickly get started with various deep learning tasks with PyTorch on Azure. + +## Contents +{: .no_toc } + +* TOC placeholder +{:toc} + + +## Why should I use ACPT? +* **Flexibility:** Use as-is with preinstalled packages or build on top of the curated environment. +* **Ease of use:** All components are installed and validated against dozens of Microsoft workloads to reduce setup costs and accelerate time to value. +* **Efficiency:** Avoid unnecessary image builds and only have required dependencies that are accessible right in the image/container. +* **Optimized training framework:** Set up, develop, and accelerate PyTorch models on large workloads, and improve training and deployment success rate. +* **Up-to-date stack:** Access the latest compatible versions of Ubuntu, Python, PyTorch, CUDA/RocM, etc. +* **Latest training optimization technologies:** Make use of ONNX Runtime, DeepSpeed, MSCCL, and more. + +## Supported configurations for Azure Container for PyTorch (ACPT) +The following configurations are supported in the Microsoft Container Registry (MCR): [ptca_image_list.md](https://aiinfra.visualstudio.com/PyTorch/_git/ptebic?path=/.azure_pipelines/pipeline_config/ptca_image_list.md&_a=preview). + +Most recent image releases (last updated 4/25/2024): + +| Environment Name | OS | GPU Version | Python Version | PyTorch Version | ORT-training Version | DeepSpeed Version | torch-ort Version | Nebula Version | +|-----------------------------|----------------|-------------|----------------|-----------------|---------------------|-------------------|-------------------|----------------| +| acpt-pytorch-2.2-cuda12.1 | Ubuntu 20.04 | cu121 | 3.10 | 2.2.2 | 1.17.3 | 0.13.1 | 1.17.3 | 0.16.11 | +| acpt-pytorch-2.1-cuda12.1 | Ubuntu 20.04 | cu121 | 3.10 | 2.1.2 | 1.17.3 | 0.13.1 | 1.17.3 | 0.16.11 | +| acpt-pytorch-2.0-cuda11.7 | Ubuntu 20.04 | cu117 | 3.10 | 2.0.1 | 1.17.3 | 0.13.1 | 1.17.3 | 0.16.11 | +| acpt-pytorch-1.13-cuda11.7 | Ubuntu 20.04 | cu117 | 3.10 | 1.13.1 | 1.17.3 | 0.13.1 | 1.17.3 | 0.16.11 | + +Other packages like fairscale, horovod, msccl, protobuf, pyspark, pytest, pytorch-lightning, tensorboard, NebulaML, torchvision, and torchmetrics are provided to support all training needs. + +## Support +Version updates for supported environments, including the base images they reference, are released every two weeks to address vulnerabilities no older than 30 days. Based on usage, some environments may be deprecated (hidden from the product but usable) to support more common machine learning scenarios. diff --git a/docs/ecosystem/index.md b/docs/ecosystem/index.md index 4b8b1509cc84b..12492ad1e42fa 100644 --- a/docs/ecosystem/index.md +++ b/docs/ecosystem/index.md @@ -16,6 +16,10 @@ ONNX Runtime functions as part of an ecosystem of tools and platforms to deliver {:toc} +## Azure Container for PyTorch (ACPT) +* [Azure Container for PyTorch (ACPT) docs](https://onnxruntime.ai/docs/ecosystem/acpt.html){:target="_blank"} +* [Azure Container for PyTorch (ACPT) - Azure Machine Learning](https://learn.microsoft.com/en-us/azure/machine-learning/resource-azure-container-for-pytorch?view=azureml-api-2){:target="_blank"} + ## Azure Machine Learning Services * [Azure Container Instance: BERT](https://github.com/microsoft/onnxruntime/tree/main/onnxruntime/python/tools/transformers/notebooks/Inference_Bert_with_OnnxRuntime_on_AzureML.ipynb){:target="_blank"} * [Azure Kubernetes Services: FER+](https://github.com/microsoft/onnxruntime/blob/main/docs/python/notebooks/onnx-inference-byoc-gpu-cpu-aks.ipynb){:target="_blank"} From 87a3c271aff9e93962494a026e4fc072b1ffa507 Mon Sep 17 00:00:00 2001 From: Sophie Schoenmeyer <107952697+sophies927@users.noreply.github.com> Date: Wed, 29 May 2024 19:08:03 +0000 Subject: [PATCH 2/6] Remove link + update version info --- docs/ecosystem/acpt.md | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/docs/ecosystem/acpt.md b/docs/ecosystem/acpt.md index e7ecee2cdd520..2e6bad7a76f06 100644 --- a/docs/ecosystem/acpt.md +++ b/docs/ecosystem/acpt.md @@ -25,16 +25,16 @@ Azure Container for PyTorch (ACPT) is a lightweight, standalone environment that * **Latest training optimization technologies:** Make use of ONNX Runtime, DeepSpeed, MSCCL, and more. ## Supported configurations for Azure Container for PyTorch (ACPT) -The following configurations are supported in the Microsoft Container Registry (MCR): [ptca_image_list.md](https://aiinfra.visualstudio.com/PyTorch/_git/ptebic?path=/.azure_pipelines/pipeline_config/ptca_image_list.md&_a=preview). +The following configurations are supported in the Microsoft Container Registry (MCR): Most recent image releases (last updated 4/25/2024): | Environment Name | OS | GPU Version | Python Version | PyTorch Version | ORT-training Version | DeepSpeed Version | torch-ort Version | Nebula Version | |-----------------------------|----------------|-------------|----------------|-----------------|---------------------|-------------------|-------------------|----------------| -| acpt-pytorch-2.2-cuda12.1 | Ubuntu 20.04 | cu121 | 3.10 | 2.2.2 | 1.17.3 | 0.13.1 | 1.17.3 | 0.16.11 | -| acpt-pytorch-2.1-cuda12.1 | Ubuntu 20.04 | cu121 | 3.10 | 2.1.2 | 1.17.3 | 0.13.1 | 1.17.3 | 0.16.11 | -| acpt-pytorch-2.0-cuda11.7 | Ubuntu 20.04 | cu117 | 3.10 | 2.0.1 | 1.17.3 | 0.13.1 | 1.17.3 | 0.16.11 | -| acpt-pytorch-1.13-cuda11.7 | Ubuntu 20.04 | cu117 | 3.10 | 1.13.1 | 1.17.3 | 0.13.1 | 1.17.3 | 0.16.11 | +| acpt-pytorch-2.2-cuda12.1 | Ubuntu 20.04 | cu121 | 3.10 | 2.2.2 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | +| acpt-pytorch-2.1-cuda12.1 | Ubuntu 20.04 | cu121 | 3.10 | 2.1.2 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | +| acpt-pytorch-2.0-cuda11.7 | Ubuntu 20.04 | cu117 | 3.10 | 2.0.1 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | +| acpt-pytorch-1.13-cuda11.7 | Ubuntu 20.04 | cu117 | 3.10 | 1.13.1 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | Other packages like fairscale, horovod, msccl, protobuf, pyspark, pytest, pytorch-lightning, tensorboard, NebulaML, torchvision, and torchmetrics are provided to support all training needs. From 70783fe1db733649011a40e2bb09c8bf7d6716ff Mon Sep 17 00:00:00 2001 From: Sophie Schoenmeyer <107952697+sophies927@users.noreply.github.com> Date: Wed, 29 May 2024 19:12:30 +0000 Subject: [PATCH 3/6] Correct spacing --- docs/ecosystem/acpt.md | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/docs/ecosystem/acpt.md b/docs/ecosystem/acpt.md index 2e6bad7a76f06..8e6a8677c9f54 100644 --- a/docs/ecosystem/acpt.md +++ b/docs/ecosystem/acpt.md @@ -31,10 +31,10 @@ Most recent image releases (last updated 4/25/2024): | Environment Name | OS | GPU Version | Python Version | PyTorch Version | ORT-training Version | DeepSpeed Version | torch-ort Version | Nebula Version | |-----------------------------|----------------|-------------|----------------|-----------------|---------------------|-------------------|-------------------|----------------| -| acpt-pytorch-2.2-cuda12.1 | Ubuntu 20.04 | cu121 | 3.10 | 2.2.2 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | -| acpt-pytorch-2.1-cuda12.1 | Ubuntu 20.04 | cu121 | 3.10 | 2.1.2 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | -| acpt-pytorch-2.0-cuda11.7 | Ubuntu 20.04 | cu117 | 3.10 | 2.0.1 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | -| acpt-pytorch-1.13-cuda11.7 | Ubuntu 20.04 | cu117 | 3.10 | 1.13.1 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | +| acpt-pytorch-2.2-cuda12.1 | Ubuntu 20.04 | cu121 | 3.10 | 2.2.2 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | +| acpt-pytorch-2.1-cuda12.1 | Ubuntu 20.04 | cu121 | 3.10 | 2.1.2 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | +| acpt-pytorch-2.0-cuda11.7 | Ubuntu 20.04 | cu117 | 3.10 | 2.0.1 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | +| acpt-pytorch-1.13-cuda11.7 | Ubuntu 20.04 | cu117 | 3.10 | 1.13.1 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | Other packages like fairscale, horovod, msccl, protobuf, pyspark, pytest, pytorch-lightning, tensorboard, NebulaML, torchvision, and torchmetrics are provided to support all training needs. From 7de8a450033a7f42e6f254e977fffebfba7fc92e Mon Sep 17 00:00:00 2001 From: Sophie Schoenmeyer <107952697+sophies927@users.noreply.github.com> Date: Wed, 29 May 2024 19:14:49 +0000 Subject: [PATCH 4/6] Correct spacing --- docs/ecosystem/acpt.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/ecosystem/acpt.md b/docs/ecosystem/acpt.md index 8e6a8677c9f54..e123add64fda9 100644 --- a/docs/ecosystem/acpt.md +++ b/docs/ecosystem/acpt.md @@ -30,7 +30,7 @@ The following configurations are supported in the Microsoft Container Registry ( Most recent image releases (last updated 4/25/2024): | Environment Name | OS | GPU Version | Python Version | PyTorch Version | ORT-training Version | DeepSpeed Version | torch-ort Version | Nebula Version | -|-----------------------------|----------------|-------------|----------------|-----------------|---------------------|-------------------|-------------------|----------------| +|-----------------------------|----------------|-------------|----------------|-----------------|----------------------|-------------------|-------------------|----------------| | acpt-pytorch-2.2-cuda12.1 | Ubuntu 20.04 | cu121 | 3.10 | 2.2.2 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | | acpt-pytorch-2.1-cuda12.1 | Ubuntu 20.04 | cu121 | 3.10 | 2.1.2 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | | acpt-pytorch-2.0-cuda11.7 | Ubuntu 20.04 | cu117 | 3.10 | 2.0.1 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | From 742aad523478ab2aa484ae43fff56d0e31a7aed1 Mon Sep 17 00:00:00 2001 From: Sophie Schoenmeyer <107952697+sophies927@users.noreply.github.com> Date: Wed, 29 May 2024 21:11:46 +0000 Subject: [PATCH 5/6] Add updated list of images --- docs/ecosystem/acpt.md | 24 +++++++++++++++--------- docs/ecosystem/ptca_image_list.md | 14 ++++++++++++++ 2 files changed, 29 insertions(+), 9 deletions(-) create mode 100644 docs/ecosystem/ptca_image_list.md diff --git a/docs/ecosystem/acpt.md b/docs/ecosystem/acpt.md index e123add64fda9..3b79c2990b9a7 100644 --- a/docs/ecosystem/acpt.md +++ b/docs/ecosystem/acpt.md @@ -1,7 +1,7 @@ --- title: Azure Container for PyTorch (ACPT) description: Learn more about Azure Container for PyTorch (ACPT) and how it utilizes ONNX Runtime -nav_order: 9 +nav_order: 1 redirect_from: /docs/tutorials/ecosystem/acpt --- # Azure Container for PyTorch (ACPT) @@ -27,14 +27,20 @@ Azure Container for PyTorch (ACPT) is a lightweight, standalone environment that ## Supported configurations for Azure Container for PyTorch (ACPT) The following configurations are supported in the Microsoft Container Registry (MCR): -Most recent image releases (last updated 4/25/2024): - -| Environment Name | OS | GPU Version | Python Version | PyTorch Version | ORT-training Version | DeepSpeed Version | torch-ort Version | Nebula Version | -|-----------------------------|----------------|-------------|----------------|-----------------|----------------------|-------------------|-------------------|----------------| -| acpt-pytorch-2.2-cuda12.1 | Ubuntu 20.04 | cu121 | 3.10 | 2.2.2 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | -| acpt-pytorch-2.1-cuda12.1 | Ubuntu 20.04 | cu121 | 3.10 | 2.1.2 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | -| acpt-pytorch-2.0-cuda11.7 | Ubuntu 20.04 | cu117 | 3.10 | 2.0.1 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | -| acpt-pytorch-1.13-cuda11.7 | Ubuntu 20.04 | cu117 | 3.10 | 1.13.1 | 1.18.0 | 0.14.2 | 1.17.3 | 0.16.11 | +| OS | GPU Type | Python Version | PyTorch Version | ORT-training version | DeepSpeed version | torch-ort Version | Nebula Version | +| - | - | - | - | - | - | - | - | +|ubuntu2004|cu117|3.8|1.13.1|1.18.0|0.14.2|1.17.0|0.16.13| +|ubuntu2004|cu117|3.9|1.13.1|1.18.0|0.14.2|1.17.0|0.16.13| +|ubuntu2004|cu117|3.10|1.13.1|1.18.0|0.14.2|1.17.0|0.16.13| +|ubuntu2004|cu118|3.8|2.0.1|1.18.0|0.14.2|1.17.0|0.16.13| +|ubuntu2004|cu118|3.10|2.0.1|1.18.0|0.14.2|1.17.0|0.16.13| +|ubuntu2004|cu118|3.8|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13| +|ubuntu2004|cu118|3.10|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13| +|ubuntu2004|cu121|3.8|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13| +|ubuntu2004|cu121|3.10|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13| +|ubuntu2004|cu118|3.10|2.3.0|1.18.0|0.14.2|1.17.0|0.16.13| +|ubuntu2004|cu121|3.10|2.3.0|1.18.0|0.14.2|1.17.0|0.16.13| +|ubuntu2004|cu121|3.8|2.1.2|1.18.0|0.14.2|1.17.0|0.16.13| Other packages like fairscale, horovod, msccl, protobuf, pyspark, pytest, pytorch-lightning, tensorboard, NebulaML, torchvision, and torchmetrics are provided to support all training needs. diff --git a/docs/ecosystem/ptca_image_list.md b/docs/ecosystem/ptca_image_list.md new file mode 100644 index 0000000000000..9b1f8dc702afa --- /dev/null +++ b/docs/ecosystem/ptca_image_list.md @@ -0,0 +1,14 @@ +| OS | GPU Type | Python Version | PyTorch Version | ORT-training version | DeepSpeed version | torch-ort Version | Nebula Version | Image Name | MCR Image Name | +| - | - | - | - | - | - | - | - | - | - | +|ubuntu2004|cu117|3.8|1.13.1|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu117-py38-torch1131|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu117-py38-torch1131| +|ubuntu2004|cu117|3.9|1.13.1|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu117-py39-torch1131|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu117-py39-torch1131| +|ubuntu2004|cu117|3.10|1.13.1|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu117-py310-torch1131|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu117-py310-torch1131| +|ubuntu2004|cu118|3.8|2.0.1|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu118-py38-torch201|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu118-py38-torch201| +|ubuntu2004|cu118|3.10|2.0.1|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu118-py310-torch201|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu118-py310-torch201| +|ubuntu2004|cu118|3.8|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu118-py38-torch222|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu118-py38-torch222| +|ubuntu2004|cu118|3.10|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu118-py310-torch222|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu118-py310-torch222| +|ubuntu2004|cu121|3.8|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu121-py38-torch222|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu121-py38-torch222| +|ubuntu2004|cu121|3.10|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu121-py310-torch222|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu121-py310-torch222| +|ubuntu2004|cu118|3.10|2.3.0|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu118-py310-torch230|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu118-py310-torch230| +|ubuntu2004|cu121|3.10|2.3.0|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu121-py310-torch230|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu121-py310-torch230| +|ubuntu2004|cu121|3.8|2.1.2|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu121-py38-torch212|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu121-py38-torch212| From ef5408eed66270f887743eb62f988bf634f11432 Mon Sep 17 00:00:00 2001 From: Sophie Schoenmeyer <107952697+sophies927@users.noreply.github.com> Date: Thu, 30 May 2024 18:47:10 +0000 Subject: [PATCH 6/6] Update image list --- docs/ecosystem/acpt.md | 17 +---------------- docs/ecosystem/ptca_image_list.md | 28 ++++++++++++++-------------- 2 files changed, 15 insertions(+), 30 deletions(-) diff --git a/docs/ecosystem/acpt.md b/docs/ecosystem/acpt.md index 3b79c2990b9a7..0f71cf431f2d6 100644 --- a/docs/ecosystem/acpt.md +++ b/docs/ecosystem/acpt.md @@ -25,22 +25,7 @@ Azure Container for PyTorch (ACPT) is a lightweight, standalone environment that * **Latest training optimization technologies:** Make use of ONNX Runtime, DeepSpeed, MSCCL, and more. ## Supported configurations for Azure Container for PyTorch (ACPT) -The following configurations are supported in the Microsoft Container Registry (MCR): - -| OS | GPU Type | Python Version | PyTorch Version | ORT-training version | DeepSpeed version | torch-ort Version | Nebula Version | -| - | - | - | - | - | - | - | - | -|ubuntu2004|cu117|3.8|1.13.1|1.18.0|0.14.2|1.17.0|0.16.13| -|ubuntu2004|cu117|3.9|1.13.1|1.18.0|0.14.2|1.17.0|0.16.13| -|ubuntu2004|cu117|3.10|1.13.1|1.18.0|0.14.2|1.17.0|0.16.13| -|ubuntu2004|cu118|3.8|2.0.1|1.18.0|0.14.2|1.17.0|0.16.13| -|ubuntu2004|cu118|3.10|2.0.1|1.18.0|0.14.2|1.17.0|0.16.13| -|ubuntu2004|cu118|3.8|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13| -|ubuntu2004|cu118|3.10|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13| -|ubuntu2004|cu121|3.8|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13| -|ubuntu2004|cu121|3.10|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13| -|ubuntu2004|cu118|3.10|2.3.0|1.18.0|0.14.2|1.17.0|0.16.13| -|ubuntu2004|cu121|3.10|2.3.0|1.18.0|0.14.2|1.17.0|0.16.13| -|ubuntu2004|cu121|3.8|2.1.2|1.18.0|0.14.2|1.17.0|0.16.13| +The following configurations are supported in the Microsoft Container Registry (MCR): [ptca_image_list.md](./ptca_image_list.md). Other packages like fairscale, horovod, msccl, protobuf, pyspark, pytest, pytorch-lightning, tensorboard, NebulaML, torchvision, and torchmetrics are provided to support all training needs. diff --git a/docs/ecosystem/ptca_image_list.md b/docs/ecosystem/ptca_image_list.md index 9b1f8dc702afa..bc6540e1870bc 100644 --- a/docs/ecosystem/ptca_image_list.md +++ b/docs/ecosystem/ptca_image_list.md @@ -1,14 +1,14 @@ -| OS | GPU Type | Python Version | PyTorch Version | ORT-training version | DeepSpeed version | torch-ort Version | Nebula Version | Image Name | MCR Image Name | -| - | - | - | - | - | - | - | - | - | - | -|ubuntu2004|cu117|3.8|1.13.1|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu117-py38-torch1131|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu117-py38-torch1131| -|ubuntu2004|cu117|3.9|1.13.1|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu117-py39-torch1131|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu117-py39-torch1131| -|ubuntu2004|cu117|3.10|1.13.1|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu117-py310-torch1131|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu117-py310-torch1131| -|ubuntu2004|cu118|3.8|2.0.1|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu118-py38-torch201|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu118-py38-torch201| -|ubuntu2004|cu118|3.10|2.0.1|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu118-py310-torch201|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu118-py310-torch201| -|ubuntu2004|cu118|3.8|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu118-py38-torch222|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu118-py38-torch222| -|ubuntu2004|cu118|3.10|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu118-py310-torch222|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu118-py310-torch222| -|ubuntu2004|cu121|3.8|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu121-py38-torch222|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu121-py38-torch222| -|ubuntu2004|cu121|3.10|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu121-py310-torch222|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu121-py310-torch222| -|ubuntu2004|cu118|3.10|2.3.0|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu118-py310-torch230|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu118-py310-torch230| -|ubuntu2004|cu121|3.10|2.3.0|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu121-py310-torch230|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu121-py310-torch230| -|ubuntu2004|cu121|3.8|2.1.2|1.18.0|0.14.2|1.17.0|0.16.13|ptebic.azurecr.io/public/aifx/acpt/stable-ubuntu2004-cu121-py38-torch212|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu121-py38-torch212| +| OS | GPU Type | Python Version | PyTorch Version | ORT-training version | DeepSpeed version | torch-ort Version | Nebula Version | MCR Image Name | +| - | - | - | - | - | - | - | - | - | +|ubuntu2004|cu117|3.8|1.13.1|1.18.0|0.14.2|1.17.0|0.16.13|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu117-py38-torch1131| +|ubuntu2004|cu117|3.9|1.13.1|1.18.0|0.14.2|1.17.0|0.16.13|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu117-py39-torch1131| +|ubuntu2004|cu117|3.10|1.13.1|1.18.0|0.14.2|1.17.0|0.16.13|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu117-py310-torch1131| +|ubuntu2004|cu118|3.8|2.0.1|1.18.0|0.14.2|1.17.0|0.16.13|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu118-py38-torch201| +|ubuntu2004|cu118|3.10|2.0.1|1.18.0|0.14.2|1.17.0|0.16.13|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu118-py310-torch201| +|ubuntu2004|cu118|3.8|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu118-py38-torch222| +|ubuntu2004|cu118|3.10|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu118-py310-torch222| +|ubuntu2004|cu121|3.8|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu121-py38-torch222| +|ubuntu2004|cu121|3.10|2.2.2|1.18.0|0.14.2|1.17.0|0.16.13|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu121-py310-torch222| +|ubuntu2004|cu118|3.10|2.3.0|1.18.0|0.14.2|1.17.0|0.16.13|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu118-py310-torch230| +|ubuntu2004|cu121|3.10|2.3.0|1.18.0|0.14.2|1.17.0|0.16.13|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu121-py310-torch230| +|ubuntu2004|cu121|3.8|2.1.2|1.18.0|0.14.2|1.17.0|0.16.13|mcr.microsoft.com/aifx/acpt/stable-ubuntu2004-cu121-py38-torch212|