From 736ed7aeb7ed0eaaec20bc4bc19e599f1606c654 Mon Sep 17 00:00:00 2001
From: Tianlei Wu <tlwu@microsoft.com>
Date: Thu, 13 Jun 2024 15:25:44 -0700
Subject: [PATCH 1/2] [Doc] Fix links in Device Tensor Doc (#21039)

---
 docs/performance/device-tensor.md | 7 +++----
 1 file changed, 3 insertions(+), 4 deletions(-)

diff --git a/docs/performance/device-tensor.md b/docs/performance/device-tensor.md
index 0ddcd8457f1ef..839258a047770 100644
--- a/docs/performance/device-tensor.md
+++ b/docs/performance/device-tensor.md
@@ -8,7 +8,7 @@ nav_order: 6
 
 Using device tensors can be a crucial part in building efficient AI pipelines, especially on heterogenous memory systems.
 A typical example of such systems is any PC with a dedicated GPU.
-While a [recent GPU](https://www.techpowerup.com/gpu-specs/geforce-rtx-4090.c3889) itself has a memory bandwidth of about 1TB/s, the interconnect [PCI 4.0 x16](https://de.wikipedia.org/wiki/PCI_Express) to the CPU can often be the limiting factor with only ~32GB/s.
+While a [recent GPU](https://www.techpowerup.com/gpu-specs/geforce-rtx-4090.c3889) itself has a memory bandwidth of about 1TB/s, the interconnect [PCI 4.0 x16](https://en.wikipedia.org/wiki/PCI_Express) to the CPU can often be the limiting factor with only ~32GB/s.
 Therefore it is often best to keep data local to the GPU as much as possible or hide slow memory traffic behind computation as the GPU is able to execute compute and PCI memory traffic simultaneously.
 
 A typical use case for these scenarios where memory is already local to the inference device would be a GPU accelerated video processing of an encoded video stream which can be decoded with GPU decoders.
@@ -20,7 +20,7 @@ Tile based inference for high resolution images is another use-case where custom
 ## CUDA
 
 CUDA in ONNX Runtime has two custom memory types.
-`"CudaPinned"` and `"Cuda"` memory where [CUDA pinned](https://developer.nvidia.com/blog/how-optimize-data-transfers-cuda-cc/) is actually CPU memory which is directly accesible by the GPU allowing for fully asynchronous up and download of memory using [`cudaMemcpyAsync`](https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__MEMORY.html#group__CUDART__MEMORY_1g85073372f776b4c4d5f89f7124b7bf79).
+`"CudaPinned"` and `"Cuda"` memory where [CUDA pinned](https://developer.nvidia.com/blog/how-optimize-data-transfers-cuda-cc/) is actually CPU memory which is directly accessible by the GPU allowing for fully asynchronous up and download of memory using [`cudaMemcpyAsync`](https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__MEMORY.html#group__CUDART__MEMORY_1g85073372f776b4c4d5f89f7124b7bf79).
 Normal CPU tensors only allow for a synchronous downloads from GPU to CPU while CPU to GPU copies can always be executed asynchronous.
 
 Allocating a tensor using the `Ort::Sessions`'s allocator is very straight forward using the [C++ API](https://onnxruntime.ai/docs/api/c/struct_ort_1_1_value.html#a5d35080239ae47cdbc9e505666dc32ec) which directly maps to the C API.
@@ -51,7 +51,7 @@ auto ort_value = Ort::Value::CreateTensor(
 These allocated tensors can then be used as [I/O Binding](../performance/tune-performance/iobinding.md) to eliminate copy ops on the network and move the responsibility to the user.
 With such IO bindings more performance tunings are possible:
 - due to the fixed tensor address, a CUDA graph can be captured to reduce CUDA launch latency on CPU
-- due to either having fully asynchronous downloads to pinned memory or eliminating memory copies by using device local tensor, CUDA can run [fully asynchronous via a run option](../execution-providers/CUDA-ExecutionProvider.md#performance-Tuning) on its given stream
+- due to either having fully asynchronous downloads to pinned memory or eliminating memory copies by using device local tensor, CUDA can run [fully asynchronous via a run option](../execution-providers/CUDA-ExecutionProvider.md#performance-tuning) on its given stream.
 
 To set the custom compute stream for CUDA, refer to the V2 option API exposing the `Ort[CUDA|TensorRT]ProviderOptionsV2*`opaque struct pointer and the function `Update[CUDA|TensorRT]ProviderOptionsWithValue(options, "user_compute_stream", cuda_stream);` to set it's stream member.
 More details can be found in each execution provider doc.
@@ -132,5 +132,4 @@ binding.bind_output("out", "dml")
 # binding.bind_ortvalue_output("out", dml_array_out)
 
 session.run_with_iobinding(binding)
-
 ```
\ No newline at end of file

From 82bb41a9f63f0ad50b481cdeb13c4212828ce659 Mon Sep 17 00:00:00 2001
From: Maanav Dalal <maanavdalal@gmail.com>
Date: Thu, 13 Jun 2024 16:01:09 -0700
Subject: [PATCH 2/2] Fixed many accessibility issues. (#20990)

### Description
Resolving or re-resolving: #20096 , #20118 , #20153 , #20151 , #20152

Done by:

- Adding skip to main content links
- Adding pause carousel
- Fixing H1s
- Fixing color scheme
- Making code blocks appear on mobile / smaller viewports.

### For testing
Please test using mobile and desktop versions, ensuring everything
(especially the landing page) appear as expected!
---
 .../InfiniteMovingCards.svelte                | 17 ++++++++-
 src/routes/+layout.svelte                     |  2 +-
 .../blogs/accelerating-llama-2/+page.svelte   | 38 +++++++++----------
 src/routes/blogs/blog-post-featured.svelte    |  2 +-
 src/routes/blogs/blog-post.svelte             |  2 +-
 src/routes/blogs/post.svelte                  |  2 +-
 .../blogs/pytorch-on-the-edge/+page.svelte    | 30 +++++++--------
 src/routes/components/code-blocks.svelte      | 38 ++++++++++++-------
 src/routes/components/customers.svelte        |  2 +-
 src/routes/components/footer.svelte           |  4 +-
 src/routes/components/header.svelte           |  4 +-
 src/routes/components/hero.svelte             |  2 +-
 src/routes/components/performance.svelte      |  2 +-
 .../components/training-and-inference.svelte  |  4 +-
 src/routes/components/winarm.svelte           | 18 ++++-----
 src/routes/events/event-post.svelte           |  2 +-
 src/routes/getting-started/+page.svelte       |  6 +--
 src/routes/huggingface/+page.svelte           | 30 +++++++--------
 src/routes/inference/+page.svelte             |  2 +-
 src/routes/onnx/+page.svelte                  |  2 +-
 .../testimonials/testimonial-card.svelte      |  2 +-
 src/routes/training/+page.svelte              | 14 +++----
 src/routes/windows/+page.svelte               |  6 +--
 23 files changed, 130 insertions(+), 101 deletions(-)

diff --git a/src/lib/components/ui/InfiniteMovingCards/InfiniteMovingCards.svelte b/src/lib/components/ui/InfiniteMovingCards/InfiniteMovingCards.svelte
index ddb822eb9454c..6c56315258661 100644
--- a/src/lib/components/ui/InfiniteMovingCards/InfiniteMovingCards.svelte
+++ b/src/lib/components/ui/InfiniteMovingCards/InfiniteMovingCards.svelte
@@ -57,14 +57,29 @@
 			}
 		}
 	};
+
+	const toggleScroll = () => {
+    if (scrollerRef) {
+      const currentState = window.getComputedStyle(scrollerRef).animationPlayState;
+      scrollerRef.style.animationPlayState = currentState === 'running' ? 'paused' : 'running';
+    }
+  };
+
+  const handleKeyDown = (event: { key: string; preventDefault: () => void; }) => {
+    if (event.key === 'Enter' || event.key === ' ') {
+      event.preventDefault(); // Prevent default spacebar scrolling behavior
+      toggleScroll();
+    }
+  };
 </script>
 
 <div bind:this={containerRef} class={cn('scroller relative z-2 overflow-hidden ', className)}>
+	<button class="hover:bg-primary focus:bg-primary menu-item py-2 sr-only focus:not-sr-only" on:keydown={handleKeyDown} on:click={toggleScroll}>Toggle scrolling</button>
 	<ul
 		bind:this={scrollerRef}
 		class={cn(
 			' flex w-max min-w-full shrink-0 flex-nowrap gap-4 py-4',
-			start && 'animate-scroll ',
+			start && 'animate-scroll',
 			pauseOnHover && 'hover:[animation-play-state:paused]'
 		)}
 	>
diff --git a/src/routes/+layout.svelte b/src/routes/+layout.svelte
index 12545e5526ad9..c6c538bc4e536 100644
--- a/src/routes/+layout.svelte
+++ b/src/routes/+layout.svelte
@@ -46,7 +46,7 @@
 		<Header />
 	{/if}
 	{#key data.pathname}
-		<div in:fade={{ duration: 300, delay: 400 }} out:fade={{ duration: 300 }}>
+		<div id="main-content" in:fade={{ duration: 300, delay: 400 }} out:fade={{ duration: 300 }}>
 			<slot />
 		</div>
 	{/key}
diff --git a/src/routes/blogs/accelerating-llama-2/+page.svelte b/src/routes/blogs/accelerating-llama-2/+page.svelte
index c16eeeb5dcafb..5854bfcb489e8 100644
--- a/src/routes/blogs/accelerating-llama-2/+page.svelte
+++ b/src/routes/blogs/accelerating-llama-2/+page.svelte
@@ -45,11 +45,11 @@
 <div class="container mx-auto px-4 md:px-8 lg:px-48 pt-8">
 	<h1 class="text-5xl pb-2">Accelerating LLaMA-2 Inference with ONNX Runtime</h1>
 	<p class="text-neutral">
-		By: <a href="https://www.linkedin.com/in/kunal-v-16315b94" class="text-blue-500"
+		By: <a href="https://www.linkedin.com/in/kunal-v-16315b94" class="text-blue-700"
 			>Kunal Vaishnavi</a
 		>
 		and
-		<a href="https://www.linkedin.com/in/parinitaparinita/" class="text-blue-500">Parinita Rahi</a>
+		<a href="https://www.linkedin.com/in/parinitaparinita/" class="text-blue-700">Parinita Rahi</a>
 	</p>
 	<p class="text-neutral">
 		14TH NOVEMBER, 2023 <span class="italic text-stone-500">(Updated 22nd November)</span>
@@ -70,13 +70,13 @@
 			quantization updates, and cross-platform usage scenarios.
 		</p>
 
-		<h2 class="text-blue-500 text-3xl mb-4">Background: Llama2 and Microsoft</h2>
+		<h2 class="text-blue-700 text-3xl mb-4">Background: Llama2 and Microsoft</h2>
 
 		<p class="mb-4">
 			Llama2 is a state-of-the-art open source LLM from Meta ranging in scale from 7B to 70B
 			parameters (7B, 13B, 70B). Microsoft and Meta <a
 				href="https://blogs.microsoft.com/blog/2023/07/18/microsoft-and-meta-expand-their-ai-partnership-with-llama-2-on-azure-and-windows/"
-				class="text-blue-500">announced</a
+				class="text-blue-700">announced</a
 			> their AI on Azure and Windows collaboration in July 2023. As part of the announcement, Llama2
 			was added to the Azure AI model catalog, which serves as a hub of foundation models that empower
 			developers and machine learning (ML) professionals to easily discover, evaluate, customize, and
@@ -89,7 +89,7 @@
 			your costs.
 		</p>
 
-		<h2 class="text-blue-500 text-3xl mb-4">
+		<h2 class="text-blue-700 text-3xl mb-4">
 			Faster Inferencing with New ONNX Runtime Optimizations
 		</h2>
 
@@ -115,7 +115,7 @@
 		</div>
 		<div class="mt-2 mb-4 text-center">Figure 1: E2E Throughput Comparisons</div>
 
-		<h2 class="text-blue-500 text-3xl mb-4">Latency and Throughput</h2>
+		<h2 class="text-blue-700 text-3xl mb-4">Latency and Throughput</h2>
 
 		<p class="mb-4">
 			The graphs below show latency comparisons between the ONNX Runtime and PyTorch variants of the
@@ -152,11 +152,11 @@
 		<p class="mb-4">
 			More details on these metrics can be found <a
 				href="https://github.com/microsoft/onnxruntime-inference-examples/blob/main/python/models/llama/README.md"
-				class="text-blue-500">here</a
+				class="text-blue-700">here</a
 			>.
 		</p>
 
-		<h2 class="text-blue-500 text-3xl mb-4">ONNX Runtime with Multi-GPU Inference</h2>
+		<h2 class="text-blue-700 text-3xl mb-4">ONNX Runtime with Multi-GPU Inference</h2>
 
 		<p class="mb-4">
 			ONNX Runtime supports multi-GPU inference to enable serving large models. Even in FP16
@@ -165,7 +165,7 @@
 		</p>
 
 		<p class="mb-4">
-			ONNX Runtime applied <a href="https://arxiv.org/pdf/1909.08053.pdf" class="text-blue-500"
+			ONNX Runtime applied <a href="https://arxiv.org/pdf/1909.08053.pdf" class="text-blue-700"
 				>Megatron-LM</a
 			>
 			Tensor Parallelism on the 70B model to split the original model weight onto different GPUs. Megatron
@@ -176,7 +176,7 @@
 			You can find additional example scripts
 			<a
 				href="https://github.com/microsoft/onnxruntime/blob/main/onnxruntime/python/tools/transformers/models/llama/"
-				class="text-blue-500">here</a
+				class="text-blue-700">here</a
 			>.
 		</p>
 
@@ -185,7 +185,7 @@
 			<figcaption class="mt-2 mb-4 text-center">Figure 4: 70B Llama2 Model Throughput</figcaption>
 		</figure>
 
-		<h2 class="text-blue-500 text-3xl mb-4">ONNX Runtime Optimizations</h2>
+		<h2 class="text-blue-700 text-3xl mb-4">ONNX Runtime Optimizations</h2>
 		<figure class="px-10 pt-4">
 			<img src={figure5} alt="LLaMA-2 Optimization Diagram" />
 			<figcaption class="mt-2 mb-4 text-center">Figure 5: LLaMA-2 Optimization Diagram</figcaption>
@@ -252,7 +252,7 @@
 			calculate the rotary embeddings more efficiently with less memory usage. The rotary embedding
 			compute kernels also support interleaved and non-interleaved formats to support both the <a
 				href="https://github.com/microsoft/Llama-2-Onnx"
-				class="text-blue-500">Microsoft version of LLaMA-2</a
+				class="text-blue-700">Microsoft version of LLaMA-2</a
 			>
 			and the Hugging Face version of LLaMA-2 respectively while sharing the same calculations.
 		</p>
@@ -260,16 +260,16 @@
 		<p class="mb-4">
 			The optimizations work for the <a
 				href="https://huggingface.co/meta-llama"
-				class="text-blue-500">Hugging Face versions</a
+				class="text-blue-700">Hugging Face versions</a
 			>
 			(models ending with <i>-hf</i>) and the Microsoft versions. You can download the optimized HF
 			versions from
-			<a href="https://github.com/microsoft/Llama-2-Onnx/tree/main-CUDA_CPU" class="text-blue-500"
+			<a href="https://github.com/microsoft/Llama-2-Onnx/tree/main-CUDA_CPU" class="text-blue-700"
 				>Microsoft's LLaMA-2 ONNX repository</a
 			>. Stay tuned for newer Microsoft versions coming soon!
 		</p>
 
-		<h2 class="text-blue-500 text-3xl mb-4">Optimize your own model using Olive</h2>
+		<h2 class="text-blue-700 text-3xl mb-4">Optimize your own model using Olive</h2>
 
 		<p class="mb-4">
 			Olive is a hardware-aware model optimization tool that incorporates advanced techniques such
@@ -281,7 +281,7 @@
 		<p class="mb-4">
 			Here is an example of <a
 				href="https://github.com/microsoft/Olive/tree/main/examples/llama2"
-				class="text-blue-500">Llama2 optimization with Olive</a
+				class="text-blue-700">Llama2 optimization with Olive</a
 			>, which harnesses ONNX Runtime optimizations highlighted in this blog. Distinct optimization
 			flows cater to various requirements. For instance, you have the flexibility to choose
 			different data types for quantization in CPU and GPU inference, based on your accuracy
@@ -289,17 +289,17 @@
 			GPUs and perform inference with ONNX Runtime optimizations.
 		</p>
 
-		<h2 class="text-blue-500 text-3xl mb-4">Usage Example</h2>
+		<h2 class="text-blue-700 text-3xl mb-4">Usage Example</h2>
 
 		<p class="mb-4">
 			Here is a <a
 				href="https://github.com/microsoft/onnxruntime-inference-examples/blob/main/python/models/llama/LLaMA-2%20E2E%20Notebook.ipynb"
-				class="text-blue-500">sample notebook</a
+				class="text-blue-700">sample notebook</a
 			> that shows you an end-to-end example of how you can use the above ONNX Runtime optimizations
 			in your application.
 		</p>
 
-		<h2 class="text-blue-500 text-3xl mb-4">Conclusion</h2>
+		<h2 class="text-blue-700 text-3xl mb-4">Conclusion</h2>
 
 		<p class="mb-4">
 			The advancements discussed in this blog provide faster Llama2 inferencing with ONNX Runtime,
diff --git a/src/routes/blogs/blog-post-featured.svelte b/src/routes/blogs/blog-post-featured.svelte
index b514fe4b29c9d..15d82a5164860 100644
--- a/src/routes/blogs/blog-post-featured.svelte
+++ b/src/routes/blogs/blog-post-featured.svelte
@@ -33,7 +33,7 @@
 				<h2 class="card-title">{title}</h2>
 				<p>{description}</p>
 				<img class="rounded" src={image} alt={imgalt} />
-				<div class="text-right text-blue-500">
+				<div class="text-right text-blue-700">
 					{date}
 				</div>
 			</div>
diff --git a/src/routes/blogs/blog-post.svelte b/src/routes/blogs/blog-post.svelte
index a661253f59672..dfc303ae6cb1e 100644
--- a/src/routes/blogs/blog-post.svelte
+++ b/src/routes/blogs/blog-post.svelte
@@ -30,7 +30,7 @@
 			<div class="card-body">
 				<h2 class="card-title">{title}</h2>
 				<p>{description}</p>
-				<p class="text-blue-500 text-right">
+				<p class="text-blue-700 text-right">
 					{date}
 				</p>
 			</div>
diff --git a/src/routes/blogs/post.svelte b/src/routes/blogs/post.svelte
index 1b024eb5b2e40..73f248e7977c4 100644
--- a/src/routes/blogs/post.svelte
+++ b/src/routes/blogs/post.svelte
@@ -82,7 +82,7 @@
 				<p class="inline">By:</p>
 			{/if}
 			{#each authors as author, i}
-				<a href={authorsLink[i]} class="text-blue-500">{author}</a>{i + 1 === authors.length
+				<a href={authorsLink[i]} class="text-blue-700">{author}</a>{i + 1 === authors.length
 					? ''
 					: ', '}
 			{/each}
diff --git a/src/routes/blogs/pytorch-on-the-edge/+page.svelte b/src/routes/blogs/pytorch-on-the-edge/+page.svelte
index 6d7f950f513a6..83ab6d2d49db6 100644
--- a/src/routes/blogs/pytorch-on-the-edge/+page.svelte
+++ b/src/routes/blogs/pytorch-on-the-edge/+page.svelte
@@ -179,9 +179,9 @@ fun run(audioTensor: OnnxTensor): Result {
 <div class="container mx-auto px-4 md:px-8 lg:px-48 pt-8">
 	<h1 class="text-5xl pb-2">Run PyTorch models on the edge</h1>
 	<p class="text-neutral">
-		By: <a href="https://www.linkedin.com/in/natkershaw/" class="text-blue-500">Natalie Kershaw</a>
+		By: <a href="https://www.linkedin.com/in/natkershaw/" class="text-blue-700">Natalie Kershaw</a>
 		and
-		<a href="https://www.linkedin.com/in/prasanthpulavarthi/" class="text-blue-500"
+		<a href="https://www.linkedin.com/in/prasanthpulavarthi/" class="text-blue-700"
 			>Prasanth Pulavarthi</a
 		>
 	</p>
@@ -217,12 +217,12 @@ fun run(audioTensor: OnnxTensor): Result {
 				anywhere that is outside of the cloud, ranging from large, well-resourced personal computers
 				to small footprint devices such as mobile phones. This has been a challenging task to
 				accomplish in the past, but new advances in model optimization and software like
-				<a href="https://onnxruntime.ai/pytorch" class="text-blue-500">ONNX Runtime</a>
+				<a href="https://onnxruntime.ai/pytorch" class="text-blue-700">ONNX Runtime</a>
 				make it more feasible - even for new generative AI and large language models like Stable Diffusion,
 				Whisper, and Llama2.
 			</p>
 
-			<h2 class="text-blue-500 text-3xl mb-4">Considerations for PyTorch models on the edge</h2>
+			<h2 class="text-blue-700 text-3xl mb-4">Considerations for PyTorch models on the edge</h2>
 
 			<p class="mb-4">
 				There are several factors to keep in mind when thinking about running a PyTorch model on the
@@ -292,7 +292,7 @@ fun run(audioTensor: OnnxTensor): Result {
 				</li>
 			</ul>
 
-			<h2 class="text-blue-500 text-3xl mb-4">Tools for PyTorch models on the edge</h2>
+			<h2 class="text-blue-700 text-3xl mb-4">Tools for PyTorch models on the edge</h2>
 
 			<p class="mb-4">
 				We mentioned ONNX Runtime several times above. ONNX Runtime is a compact, standards-based
@@ -305,7 +305,7 @@ fun run(audioTensor: OnnxTensor): Result {
 				format that doesn't require the PyTorch framework and its gigabytes of dependencies. PyTorch
 				has thought about this and includes an API that enables exactly this - <a
 					href="https://pytorch.org/docs/stable/onnx.html"
-					class="text-blue-500">torch.onnx</a
+					class="text-blue-700">torch.onnx</a
 				>. <a href="https://onnx.ai/">ONNX</a> is an open standard that defines the operators that make
 				up models. The PyTorch ONNX APIs take the Pythonic PyTorch code and turn it into a functional
 				graph that captures the operators that are needed to run the model without Python. As with everything
@@ -318,7 +318,7 @@ fun run(audioTensor: OnnxTensor): Result {
 				The popular Hugging Face library also has APIs that build on top of this torch.onnx
 				functionality to export models to the ONNX format. Over <a
 					href="https://huggingface.co/blog/ort-accelerating-hf-models"
-					class="text-blue-500">130,000 models</a
+					class="text-blue-700">130,000 models</a
 				> are supported making it very likely that the model you care about is one of them.
 			</p>
 
@@ -328,7 +328,7 @@ fun run(audioTensor: OnnxTensor): Result {
 				and web browsers) via various languages (from C# to JavaScript to Swift).
 			</p>
 
-			<h2 class="text-blue-500 text-3xl mb-4">Examples of PyTorch models on the edge</h2>
+			<h2 class="text-blue-700 text-3xl mb-4">Examples of PyTorch models on the edge</h2>
 
 			<h3 class=" text-2xl mb-2">Stable Diffusion on Windows</h3>
 
@@ -345,7 +345,7 @@ fun run(audioTensor: OnnxTensor): Result {
 			<p class="mb-4">
 				You don't have to export the fifth model, ClipTokenizer, as it is available in <a
 					href="https://onnxruntime.ai/docs/extensions"
-					class="text-blue-500">ONNX Runtime extensions</a
+					class="text-blue-700">ONNX Runtime extensions</a
 				>, a library for pre and post processing PyTorch models.
 			</p>
 
@@ -366,7 +366,7 @@ fun run(audioTensor: OnnxTensor): Result {
 			<p class="mb-4">
 				You can build the application and run it on Windows with the detailed steps shown in this <a
 					href="https://onnxruntime.ai/docs/tutorials/csharp/stable-diffusion-csharp.html"
-					class="text-blue-500">tutorial</a
+					class="text-blue-700">tutorial</a
 				>.
 			</p>
 
@@ -374,7 +374,7 @@ fun run(audioTensor: OnnxTensor): Result {
 
 			<p class="mb-4">
 				Running a PyTorch model locally in the browser is not only possible but super simple with
-				the <a href="https://huggingface.co/docs/transformers.js/index" class="text-blue-500"
+				the <a href="https://huggingface.co/docs/transformers.js/index" class="text-blue-700"
 					>transformers.js</a
 				> library. Transformers.js uses ONNX Runtime Web as its backend. Many models are already converted
 				to ONNX and served by the tranformers.js CDN, making inference in the browser a matter of writing
@@ -407,7 +407,7 @@ fun run(audioTensor: OnnxTensor): Result {
 				All components of the Whisper Tiny model (audio decoder, encoder, decoder, and text sequence
 				generation) can be composed and exported to a single ONNX model using the <a
 					href="https://github.com/microsoft/Olive/tree/main/examples/whisper"
-					class="text-blue-500">Olive framework</a
+					class="text-blue-700">Olive framework</a
 				>. To run this model as part of a mobile application, you can use ONNX Runtime Mobile, which
 				supports Android, iOS, react-native, and MAUI/Xamarin.
 			</p>
@@ -420,7 +420,7 @@ fun run(audioTensor: OnnxTensor): Result {
 			<p class="mb-4">
 				The relevant snippet of a example <a
 					href="https://github.com/microsoft/onnxruntime-inference-examples/tree/main/mobile/examples/speech_recognition"
-					class="text-blue-500">Android mobile app</a
+					class="text-blue-700">Android mobile app</a
 				> that performs speech transcription on short samples of audio is shown below:
 			</p>
 			<Highlight language={kotlin} code={mobilecode} />
@@ -476,11 +476,11 @@ fun run(audioTensor: OnnxTensor): Result {
 			<p class="mb-4">
 				You can read the full <a
 					href="https://onnxruntime.ai/docs/tutorials/on-device-training/ios-app.html"
-					class="text-blue-500">Speaker Verification tutorial</a
+					class="text-blue-700">Speaker Verification tutorial</a
 				>, and
 				<a
 					href="https://github.com/microsoft/onnxruntime-training-examples/tree/master/on_device_training/mobile/ios"
-					class="text-blue-500">build and run the application from source</a
+					class="text-blue-700">build and run the application from source</a
 				>.
 			</p>
 
diff --git a/src/routes/components/code-blocks.svelte b/src/routes/components/code-blocks.svelte
index 531d8b9a6ff75..fd51f4292e2dd 100644
--- a/src/routes/components/code-blocks.svelte
+++ b/src/routes/components/code-blocks.svelte
@@ -7,9 +7,11 @@
 	import cpp from 'svelte-highlight/languages/cpp';
 	import FaLink from 'svelte-icons/fa/FaLink.svelte';
 	import { blur, fade } from 'svelte/transition';
+	import { d } from 'svelte-highlight/languages';
+	import github from "svelte-highlight/styles/github";
 
 	let pythonCode =
-		'import onnxruntime as ort\n# Load the model and create InferenceSession\nmodel_path = "path/to/your/onnx/model"\nsession = ort.InferenceSession(model_path)\n# Load and preprocess the input image inputTensor\n...\n# Run inference\noutputs = session.run(None {"input": inputTensor})\nprint(outputs)';
+		'import onnxruntime as ort\n# Load the model and create InferenceSession\nmodel_path = "path/to/your/onnx/model"\nsession = ort.InferenceSession(model_path)\n# "Load and preprocess the input image inputTensor"\n...\n# Run inference\noutputs = session.run(None {"input": inputTensor})\nprint(outputs)';
 	let csharpCode =
 		'using Microsoft.ML.OnnxRuntime;\n// Load the model and create InferenceSession\nstring model_path = "path/to/your/onnx/model";\nvar session = new InferenceSession(model_path);\n// Load and preprocess the input image to inputTensor\n...\n// Run inference\nvar outputs = session.Run(inputTensor).ToList();\nConsole.WriteLine(outputs[0].AsTensor()[0]);';
 	let javascriptCode =
@@ -45,30 +47,40 @@
 		activeTab = tabText;
 		activeTab = activeTab;
 	};
-</script>
+	let innerWidth = 0
 
+</script>
+<svelte:window bind:innerWidth/>
+<svelte:head>
+  {@html github}
+</svelte:head>
 <div class="container mx-auto px-4">
 	<h3 class="text-xl mb-4 text-center">
 		Use ONNX Runtime with your favorite language and get started with the tutorials:
 	</h3>
 	<div class="grid-cols-1 lg:grid-cols-3 gap-4 grid">
 		<div class="col-span-1 mx-auto mt-6 mx-4 lg:mx-0 lg:ml-10">
-			<div class="join join-vertical gap-4 w-full">
-				<a href="./getting-started" class="btn btn-primary rounded-sm btn-block">Quickstart</a>
-				<a rel="external" href="./docs/tutorials" class="btn btn-primary rounded-sm btn-block"
+			<div class="mx-auto">
+				<a href="./getting-started" class="my-2 btn btn-primary rounded-sm btn-block">Quickstart</a>
+				<a rel="external" href="./docs/tutorials" class="my-2 btn btn-primary rounded-sm btn-block"
 					>Tutorials</a
 				>
-				<a rel="external" href="./docs/install" class="btn btn-primary rounded-sm btn-block"
+				<a rel="external" href="./docs/install" class="my-2 btn btn-primary rounded-sm btn-block"
 					>Install ONNX Runtime</a
 				>
 				<a
 					rel="external"
 					href="./docs/execution-providers"
-					class="btn btn-primary rounded-sm btn-block">Hardware acceleration</a
+					class="my-2 btn btn-primary rounded-sm btn-block">Hardware acceleration</a
+				>
+				<a
+					rel="external"
+					href="./docs/get-started"
+					class="lg:hidden my-2 btn btn-primary rounded-sm btn-block">Get started (Docs)</a
 				>
 			</div>
 		</div>
-		<div class="hidden lg:block col-span-2 mx-auto min-w-[675px] min-h-[400px]">
+		<div class="col-span-2 lg:mx-auto lg:min-w-[675px] min-h-[400px] max-w-[100vw]">
 			<div class="tabs tabs-bordered">
 				<p
 					on:mouseenter={handleClick}
@@ -81,9 +93,9 @@
 				</p>
 				<p
 					on:mouseenter={handleClick}
-					class="tab tab-lg {activeTab === 'JavaScript' ? 'tab-active' : ''}"
+					class="tab tab-lg {activeTab === 'JavaScript' || activeTab == 'JS' ? 'tab-active' : ''}"
 				>
-					JavaScript
+					{innerWidth >=  1024 ? 'JavaScript' : 'JS'}
 				</p>
 				<p
 					on:mouseenter={handleClick}
@@ -96,11 +108,11 @@
 				</p>
 				<button
 					on:click={handleClick}
-					class="tab tab-lg {activeTab === 'More..' ? 'tab-active' : ''}">More..</button
+					class="tab tab-lg hidden lg:block {activeTab === 'More..' ? 'tab-active' : ''}">More..</button
 				>
 			</div>
 			{#if activeTab === 'Python'}
-				<Highlight language={python} code={pythonCode} />
+				<Highlight class="bg-primary" language={python} code={pythonCode} />
 				<div class="div" in:fade={{ duration: 500 }}>
 					<a
 						href="https://onnxruntime.ai/docs/get-started/with-python"
@@ -117,7 +129,7 @@
 						>C# Docs<span class="w-5 h-5"><FaLink /></span></a
 					>
 				</div>
-			{:else if activeTab === 'JavaScript'}
+			{:else if activeTab === 'JavaScript' || activeTab === 'JS'}
 				<div class="div" in:fade={{ duration: 500 }}>
 					<Highlight language={javascript} code={javascriptCode} />
 					<a
diff --git a/src/routes/components/customers.svelte b/src/routes/components/customers.svelte
index 0e9bfbecca29e..b0f802adeee7b 100644
--- a/src/routes/components/customers.svelte
+++ b/src/routes/components/customers.svelte
@@ -185,7 +185,7 @@
 		speed="slow"
 	/>
 	<div class="container mx-auto md:px-16 px-8 lg:my-10">
-		<h1 class="text-2xl pt-10 pb-4">Learn more about how to use ONNX Runtime with</h1>
+		<h2 class="text-2xl pt-10 pb-4">Learn more about how to use ONNX Runtime with</h2>
 		<div class="grid md:grid-cols-3 grid-cols-1 gap-4 mx-auto pb-10">
 			<a
 				href="./pytorch"
diff --git a/src/routes/components/footer.svelte b/src/routes/components/footer.svelte
index 95d3f7c36a141..b030524976742 100644
--- a/src/routes/components/footer.svelte
+++ b/src/routes/components/footer.svelte
@@ -6,7 +6,7 @@
 	export let pathvar = '.';
 </script>
 
-<footer class="footer p-10 mt-10 bg-base-200 text-base-content z-40">
+<footer class="footer p-10 mt-10 text-base-content z-40 border-top border-t">
 	<div>
 		<p>ONNX Runtime<br />Copyright © Microsoft. All rights reserved.</p>
 		<span class="footer-title">Follow us at:</span>
@@ -24,7 +24,7 @@
 	</div>
 	<div />
 	<div>
-		<span class="footer-title">Get Started</span>
+		<span class="footer-title text-bold	">Get Started</span>
 		<a href={pathvar + '/getting-started'} class="link link-hover">Install</a>
 		<a href={pathvar + '/pytorch'} class="link link-hover">PyTorch</a>
 	</div>
diff --git a/src/routes/components/header.svelte b/src/routes/components/header.svelte
index be22df9c41003..e6fab22abf3be 100644
--- a/src/routes/components/header.svelte
+++ b/src/routes/components/header.svelte
@@ -79,10 +79,12 @@
 				<OnnxLight width={32} height={32} />
 			</div>
 		</a>
+		
+		<a class="hover:bg-primary focus:bg-primary menu-item py-2 sr-only focus:not-sr-only" href="#main-content">Skip to main content</a>
 	</div>
 	<!-- Navbar for deskop -->
 	<div class="navbar-center hidden lg:flex">
-		<ul class="menu menu-horizontal px-1">
+		<ul class="menu menu-horizontal px-1">		
 			<li><a class="hover:bg-primary focus:bg-primary" href={pathvar + '/getting-started'}>Get Started</a></li>
 			<li><a class="hover:bg-primary focus:bg-primary" href={pathvar + '/blogs'}>Blogs</a></li>
 			<li><a class="hover:bg-primary focus:bg-primary" rel="external" href={pathvar + '/docs'}>Docs</a></li>
diff --git a/src/routes/components/hero.svelte b/src/routes/components/hero.svelte
index 38782b82daa5d..a1124c55f0a72 100644
--- a/src/routes/components/hero.svelte
+++ b/src/routes/components/hero.svelte
@@ -126,7 +126,7 @@
 					<a class="underline" href="http://">More interested in training? More info here.</a>
 				</p> -->
 				<p class="text-lg mt-2">
-					<a class="text-blue-500 font-medium" href="./getting-started"
+					<a class="text-blue-700 font-medium" href="./getting-started"
 						>Don't see your favorite platform? See the many others we support →</a
 					>
 				</p>
diff --git a/src/routes/components/performance.svelte b/src/routes/components/performance.svelte
index 53b0413a69d49..2af3cdf2204ee 100644
--- a/src/routes/components/performance.svelte
+++ b/src/routes/components/performance.svelte
@@ -11,7 +11,7 @@
 				CPU, GPU, NPU - no matter what hardware you run on, ONNX Runtime optimizes for latency,
 				throughput, memory utilization, and binary size. In addition to excellent out-of-the-box
 				performance for common usage patterns, additional
-				<a href="https://onnxruntime.ai/docs/performance/" class="text-blue-500"
+				<a href="https://onnxruntime.ai/docs/performance/" class="text-blue-700 underline"
 					>model optimization techniques</a
 				> and runtime configurations are available to further improve performance for specific use cases
 				and models.
diff --git a/src/routes/components/training-and-inference.svelte b/src/routes/components/training-and-inference.svelte
index 25a7cdc1f99a3..5089b15999f3d 100644
--- a/src/routes/components/training-and-inference.svelte
+++ b/src/routes/components/training-and-inference.svelte
@@ -62,10 +62,10 @@
 					<p class="text-lg">
 						Accelerate training of popular models, including <a
 							href="https://huggingface.co/"
-							class="text-blue-500">Hugging Face</a
+							class="text-blue-700">Hugging Face</a
 						>
 						models like Llama-2-7b and curated models from the
-						<a href="https://ml.azure.com/" class="text-blue-500"
+						<a href="https://ml.azure.com/" class="text-blue-700"
 							>Azure AI | Machine Learning Studio</a
 						> model catalog.
 					</p>
diff --git a/src/routes/components/winarm.svelte b/src/routes/components/winarm.svelte
index 79274200ccc75..4334a4fa9cc09 100644
--- a/src/routes/components/winarm.svelte
+++ b/src/routes/components/winarm.svelte
@@ -35,33 +35,33 @@
 	<div class="divider" />
 	<div class="grid grid-cols-3 gap-4">
 		<div class="md:col-span-2 col-span-3">
-			<h2 class="text-xl text-blue-500">Get started on your Windows Dev Kit 2023 today</h2>
+			<h2 class="text-xl text-blue-700">Get started on your Windows Dev Kit 2023 today</h2>
 			Follow these steps to setup your device to use ONNX Runtime (ORT) with the built in NPU:
 			<ol class="list-decimal ml-10">
 				<li>
 					<a
-						class="text-blue-500"
+						class="text-blue-700"
 						href="https://qpm.qualcomm.com/main/tools/details/qualcomm_ai_engine_direct">Download</a
 					> the Qualcomm AI Engine Direct SDK (QNN SDK)
 				</li>
 				<li>
 					<a
-						class="text-blue-500"
+						class="text-blue-700"
 						href="https://www.nuget.org/packages/Microsoft.ML.OnnxRuntime.QNN">Download</a
 					> and install the ONNX Runtime with QNN package
 				</li>
 				<li>Start using the ONNX Runtime API in your application.</li>
 			</ol>
 			<br /><br />
-			<p class="text-xl text-blue-500">Optimizing models for the NPU</p>
-			<a class="text-blue-500" href="https://onnx.ai/">ONNX</a> is a standard format for
+			<p class="text-xl text-blue-700">Optimizing models for the NPU</p>
+			<a class="text-blue-700" href="https://onnx.ai/">ONNX</a> is a standard format for
 			representing ML models authored in frameworks like PyTorch, TensorFlow, and others. ONNX
 			Runtime can run any ONNX model, however to make use of the NPU, you currently need to quantize
 			the ONNX model to QDQ model.
 			<br />
 			See our
 			<a
-				class="text-blue-500"
+				class="text-blue-700"
 				href="https://github.com/microsoft/onnxruntime-inference-examples/tree/main/c_cxx/QNN_EP/mobilenetv2_classification"
 				>C# tutorial</a
 			>
@@ -70,13 +70,13 @@
 			Many models can be optimized for the NPU using this process. Even if a model cannot be optimized
 			for the NPU, it can still be run by ONNX Runtime on the CPU.
 			<br /><br />
-			<p class="text-xl text-blue-500">Getting Help</p>
+			<p class="text-xl text-blue-700">Getting Help</p>
 			For help with ONNX Runtime, you can<a
-				class="text-blue-500"
+				class="text-blue-700"
 				href="https://github.com/microsoft/onnxruntime/discussions">start a discussion</a
 			>
 			on GitHub or
-			<a class="text-blue-500" href="https://github.com/microsoft/onnxruntime/issues"
+			<a class="text-blue-700" href="https://github.com/microsoft/onnxruntime/issues"
 				>file an issue</a
 			>.
 		</div>
diff --git a/src/routes/events/event-post.svelte b/src/routes/events/event-post.svelte
index b1f4fc4a4da1a..6fcaf93f691fc 100644
--- a/src/routes/events/event-post.svelte
+++ b/src/routes/events/event-post.svelte
@@ -33,7 +33,7 @@
 			<div class="card-body col-span-3 md:col-span-2">
 				<h2 class="card-title">{title}</h2>
 				<p>{description}</p>
-				<p class="text-blue-500 text-right">
+				<p class="text-blue-700 text-right">
 					{date}
 				</p>
 				<div class="card-actions">
diff --git a/src/routes/getting-started/+page.svelte b/src/routes/getting-started/+page.svelte
index d8fc48dc3eb3b..3fde8aaab57e5 100644
--- a/src/routes/getting-started/+page.svelte
+++ b/src/routes/getting-started/+page.svelte
@@ -34,7 +34,7 @@
 	<p class="pt-4">
 		For more in-depth installation instructions, check out the <a
 			href="https://onnxruntime.ai/docs/tutorials/"
-			class="text-blue-500">ONNX Runtime documentation</a
+			class="text-blue-700">ONNX Runtime documentation</a
 		>.
 	</p>
 </div>
@@ -45,9 +45,9 @@
 			If you are interested in joining the ONNX Runtime open source community, you might want to join
 			us on GitHub where you can interact with other users and developers, participate in<a
 				href="https://github.com/microsoft/onnxruntime/discussions"
-				class="text-blue-500">discussions</a
+				class="text-blue-700">discussions</a
 			>, and get help with any
-			<a href="https://github.com/microsoft/onnxruntime/issues" class="text-blue-500">issues</a> you
+			<a href="https://github.com/microsoft/onnxruntime/issues" class="text-blue-700">issues</a> you
 			encounter. You can also contribute to the project by reporting bugs, suggesting features, or
 			submitting pull requests.
 			<div class="py-4">
diff --git a/src/routes/huggingface/+page.svelte b/src/routes/huggingface/+page.svelte
index 513f2acb2ba60..94eb27af81b4d 100644
--- a/src/routes/huggingface/+page.svelte
+++ b/src/routes/huggingface/+page.svelte
@@ -81,28 +81,28 @@
 			<p class="pb-4">
 				The top 30 most popular model architectures on Hugging Face are all supported by ONNX
 				Runtime, and over 80 Hugging Face model architectures in total boast ORT support. This list
-				includes <a href="https://huggingface.co/models?other=bert" class="text-blue-500">BERT</a>,
-				<a href="https://huggingface.co/models?other=gpt2" class="text-blue-500">GPT2</a>,
-				<a href="https://huggingface.co/models?other=t5" class="text-blue-500">T5</a>,
-				<a href="https://huggingface.co/models?other=stable-diffusion" class="text-blue-500"
+				includes <a href="https://huggingface.co/models?other=bert" class="text-blue-700">BERT</a>,
+				<a href="https://huggingface.co/models?other=gpt2" class="text-blue-700">GPT2</a>,
+				<a href="https://huggingface.co/models?other=t5" class="text-blue-700">T5</a>,
+				<a href="https://huggingface.co/models?other=stable-diffusion" class="text-blue-700"
 					>Stable Diffusion</a
 				>,
-				<a href="https://huggingface.co/models?other=whisper" class="text-blue-500">Whisper</a>, and
+				<a href="https://huggingface.co/models?other=whisper" class="text-blue-700">Whisper</a>, and
 				many more.
 			</p>
 			<p class="pb-4">
 				ONNX models can be found directly from the Hugging Face Model Hub in its <a
 					href="https://huggingface.co/models?library=onnx"
-					class="text-blue-500">ONNX model library</a
+					class="text-blue-700">ONNX model library</a
 				>.
 			</p>
 			<p class="pb-4">
 				Hugging Face also provides ONNX support for a variety of other models not listed in the ONNX
 				model library. With <a
 					href="https://huggingface.co/docs/optimum/exporters/onnx/overview"
-					class="text-blue-500">Hugging Face Optimum</a
+					class="text-blue-700">Hugging Face Optimum</a
 				>, you can easily convert pretrained models to ONNX, and
-				<a href="https://huggingface.co/docs/transformers.js/index" class="text-blue-500"
+				<a href="https://huggingface.co/docs/transformers.js/index" class="text-blue-700"
 					>Transformers.js</a
 				> lets you run Hugging Face Transformers directly from your browser!
 			</p>
@@ -119,16 +119,16 @@
 				ONNX Runtime also supports many increasingly popular large language model (LLM)
 				architectures, including <a
 					href="https://huggingface.co/models?other=llama"
-					class="text-blue-500">LLaMA</a
+					class="text-blue-700">LLaMA</a
 				>,
-				<a href="https://huggingface.co/models?other=gpt_neo" class="text-blue-500">GPT Neo</a>,
-				<a href="https://huggingface.co/models?other=bloom" class="text-blue-500">BLOOM</a>, and
+				<a href="https://huggingface.co/models?other=gpt_neo" class="text-blue-700">GPT Neo</a>,
+				<a href="https://huggingface.co/models?other=bloom" class="text-blue-700">BLOOM</a>, and
 				many more.
 			</p>
 			<p>
 				Hugging Face also provides an <a
 					href="https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard"
-					class="text-blue-500">Open LLM Leaderboard</a
+					class="text-blue-700">Open LLM Leaderboard</a
 				> with more detailed tracking and evaluation of recently releases LLMs from the community.
 			</p>
 		</div>
@@ -149,7 +149,7 @@
 				and designs responsible AI solutions.
 			</p>
 			<p>
-				<a href="https://ml.azure.com/" class="text-blue-500">Azure Machine Learning</a> publishes a
+				<a href="https://ml.azure.com/" class="text-blue-700">Azure Machine Learning</a> publishes a
 				curated model list that is updated regularly and includes the most popular models. You can run
 				the vast majority of the models on the curated list with ONNX Runtime, using HuggingFace Optimum.
 			</p>
@@ -166,12 +166,12 @@
 		<div>
 			<h1 class="text-3xl pb-4">Transformers.js + ONNX Runtime Web</h1>
 			<p class="pb-4">
-				<a href="https://huggingface.co/docs/transformers.js/index" class="text-blue-500"
+				<a href="https://huggingface.co/docs/transformers.js/index" class="text-blue-700"
 					>Transformers.js</a
 				>
 				is an amazing tool to run transformers on the web, designed to be functionally equivalent to
 				Hugging Face’s
-				<a href="https://github.com/huggingface/transformers" class="text-blue-500">transformers</a>
+				<a href="https://github.com/huggingface/transformers" class="text-blue-700">transformers</a>
 				python library.
 			</p>
 			<p class="pb-4">
diff --git a/src/routes/inference/+page.svelte b/src/routes/inference/+page.svelte
index b9c45332b2873..7eb801b0b5ae7 100644
--- a/src/routes/inference/+page.svelte
+++ b/src/routes/inference/+page.svelte
@@ -156,7 +156,7 @@
 		</div>
 		<a
 			href="https://github.com/microsoft/onnxruntime-inference-examples/tree/main/mobile"
-			class="text-2xl text-blue-500">See more examples of ONNX Runtime Mobile on GitHub. →</a
+			class="text-2xl text-blue-700">See more examples of ONNX Runtime Mobile on GitHub. →</a
 		>
 	</div>
 </div>
diff --git a/src/routes/onnx/+page.svelte b/src/routes/onnx/+page.svelte
index 2fbc61bcf8b08..e951546fd5563 100644
--- a/src/routes/onnx/+page.svelte
+++ b/src/routes/onnx/+page.svelte
@@ -15,7 +15,7 @@
 					We hope your stay is short and that you quickly get what you need!
 				</p>
 				<p class="text-lg mb-4">
-					All below links are <a href="https://aka.ms" class="text-blue-500">aka.ms/</a> supported, so
+					All below links are <a href="https://aka.ms" class="text-blue-700">aka.ms/</a> supported, so
 					feel free to use those in the future.
 				</p>
 			</div>
diff --git a/src/routes/testimonials/testimonial-card.svelte b/src/routes/testimonials/testimonial-card.svelte
index ef7bd7a755779..d432149556c88 100644
--- a/src/routes/testimonials/testimonial-card.svelte
+++ b/src/routes/testimonials/testimonial-card.svelte
@@ -35,7 +35,7 @@
 			<p class="block mt-1 leading-tight font-bold text-neutral text-lg">{title}</p>
 			<p class="mt-2 text-neutral">{description}</p>
 			<br />
-			<p class="text-blue-500 text-right">-{author}</p>
+			<p class="text-blue-700 text-right">-{author}</p>
 		</div>
 	</div>
 </article>
diff --git a/src/routes/training/+page.svelte b/src/routes/training/+page.svelte
index 4a79374814646..44fd288350c49 100644
--- a/src/routes/training/+page.svelte
+++ b/src/routes/training/+page.svelte
@@ -87,12 +87,12 @@
 				<h2 class="card-title">Part of the PyTorch ecosystem</h2>
 				<p>
 					ONNX Runtime Training is available via the <a
-						class="text-blue-500"
+						class="text-blue-700"
 						href="https://pytorch.org/ort/">torch-ort</a
 					>
 					package as part of the
 					<a
-						class="text-blue-500"
+						class="text-blue-700"
 						href="https://learn.microsoft.com/en-us/azure/machine-learning/resource-azure-container-for-pytorch?view=azureml-api-2"
 						>Azure Container for PyTorch (ACPT)</a
 					> and seamlessly integrates with existing training pipelines for PyTorch models.
@@ -103,11 +103,11 @@
 			<div class="card-body items-center text-center">
 				<h2 class="card-title">Composable with popular acceleration systems</h2>
 				<p>
-					Compose with <a href="https://github.com/microsoft/DeepSpeed" class="text-blue-500"
+					Compose with <a href="https://github.com/microsoft/DeepSpeed" class="text-blue-700"
 						>DeepSpeed</a
 					>,
-					<a href="https://github.com/facebookresearch/fairscale" class="text-blue-500">FairScale</a
-					>, <a href="https://github.com/NVIDIA/Megatron-LM" class="text-blue-500">Megatron</a>, and
+					<a href="https://github.com/facebookresearch/fairscale" class="text-blue-700">FairScale</a
+					>, <a href="https://github.com/NVIDIA/Megatron-LM" class="text-blue-700">Megatron</a>, and
 					more for even faster and more efficient training.
 				</p>
 			</div>
@@ -118,7 +118,7 @@
 				<p>
 					ORT Training is turned on for curated models in the <a
 						href="https://ml.azure.com/"
-						class="text-blue-500">Azure AI | Machine Learning Studio</a
+						class="text-blue-700">Azure AI | Machine Learning Studio</a
 					> model catalog.
 				</p>
 			</div>
@@ -129,7 +129,7 @@
 				<p>
 					ORT Training can be used to accelerate Hugging Face models like Llama-2-7b through <a
 						href="https://github.com/huggingface/optimum/blob/main/examples/onnxruntime/training/text-classification/README.md#onnx-runtime-training"
-						class="text-blue-500">these scripts</a
+						class="text-blue-700">these scripts</a
 					>.
 				</p>
 			</div>
diff --git a/src/routes/windows/+page.svelte b/src/routes/windows/+page.svelte
index d816ac5255624..fb1b2246bc3fb 100644
--- a/src/routes/windows/+page.svelte
+++ b/src/routes/windows/+page.svelte
@@ -92,18 +92,18 @@
 				<h2 class="card-title">Windows ML Samples Gallery</h2>
 				<p>
 					This gallery demonstrates different machine learning scenarios and features using <a
-						class="text-blue-500"
+						class="text-blue-700"
 						href="https://docs.microsoft.com/en-us/windows/ai/windows-ml/">Windows ML</a
 					>
 					in an interactive format. The app is an interactive companion that shows the integration of
 					<a
-						class="text-blue-500"
+						class="text-blue-700"
 						href="https://docs.microsoft.com/en-us/uwp/api/windows.ai.machinelearning"
 						>Windows Machine Learning Library APIs</a
 					>
 					into a desktop
 					<a
-						class="text-blue-500"
+						class="text-blue-700"
 						href="https://docs.microsoft.com/en-us/uwp/api/windows.ai.machinelearning">WinUI 3</a
 					> application.
 				</p>