Updated images.

microsoft · Feb 27, 2024 · b612686 · b612686
1 parent cef1ea7
commit b612686
Show file tree

Hide file tree

Showing 7 changed files with 3 additions and 3 deletions.
diff --git a/src/routes/blogs/accelerating-phi-2/+page.svx b/src/routes/blogs/accelerating-phi-2/+page.svx
@@ -85,9 +85,9 @@ Here is an example of [Phi-2 optimizations with Olive](https://github.com/micros
 
 ## Training
 
-In addition to inference, ONNX Runtime also provides training speedup for Phi-2 and other LLMs. ORT Training is part of the PyTorch Ecosystem and is available via the torch-ort python package, as part of the Azure Container for PyTorch (ACPT). It provides flexible and extensible hardware support, where the same model and APIs works with both NVIDIA and AMD GPUs. ORT accelerates training through optimized kernels and memory optimizations which show significant gains in reducing end-to-end training time for large model training. This involves changing a few lines of code in the model to wrap it with the ORTModule API. It is also composable with popular acceleration libraries like DeepSpeed and Megatron for faster and more efficient training.
+In addition to inference, ONNX Runtime also provides training speedup for Phi-2 and other LLMs. ORT Training is part of the PyTorch Ecosystem and is available via the torch-ort python package, as part of the [Azure Container for PyTorch (ACPT)](https://learn.microsoft.com/en-us/azure/machine-learning/resource-azure-container-for-pytorch?view=azureml-api-2). It provides flexible and extensible hardware support, where the same model and APIs works with both NVIDIA and AMD GPUs. ORT accelerates training through optimized kernels and memory optimizations which show significant gains in reducing end-to-end training time for large model training. This involves changing a few lines of code in the model to wrap it with the ORTModule API. It is also composable with popular acceleration libraries like DeepSpeed and Megatron for faster and more efficient training.
 
-Open AI's Triton is a domain specific language and compiler to write highly efficient custom deep learning primitives. ORT supports Open AI Triton integration (ORT+Triton), where all element wise operators are converted to Triton ops and ORT creates custom fused kernels in Triton.
+[Open AI's Triton](https://openai.com/research/triton) is a domain specific language and compiler to write highly efficient custom deep learning primitives. ORT supports Open AI Triton integration (ORT+Triton), where all element wise operators are converted to Triton ops and ORT creates custom fused kernels in Triton.
 
 ORT also performs sparsity optimization to assess input data sparsity and perform graph optimizations leveraging this sparsity. This reduces the compute FLOP requirements and increases performance.
 

diff --git a/src/routes/blogs/accelerating-phi-2/Llama2_Training.png b/src/routes/blogs/accelerating-phi-2/Llama2_Training.png
diff --git a/src/routes/blogs/accelerating-phi-2/Mistral_Training.png b/src/routes/blogs/accelerating-phi-2/Mistral_Training.png
diff --git a/src/routes/blogs/accelerating-phi-2/Orca2_Training.png b/src/routes/blogs/accelerating-phi-2/Orca2_Training.png
diff --git a/src/routes/blogs/accelerating-phi-2/Phi2_trainingTP.png b/src/routes/blogs/accelerating-phi-2/Phi2_trainingTP.png
diff --git a/src/routes/blogs/accelerating-phi-2/Phi2_training_2a100.png b/src/routes/blogs/accelerating-phi-2/Phi2_training_2a100.png
diff --git a/src/routes/blogs/github-markdown-light.css b/src/routes/blogs/github-markdown-light.css
@@ -3,7 +3,7 @@ ul {
 }
 
 .w50{
-  width: 50em;
+  width: 35em;
 }
 /*light*/
-Original file line number
+Diff line change
@@ Expand Up / @@ -3,7 +3,7 @@ ul { @@
     }
     .w50{
-      width: 50em;
+      width: 35em;
     }
     /*light*/
@@ Expand Down @@