Skip to content

Commit

Permalink
Added last updated date, fixed addition of 'eager'
Browse files Browse the repository at this point in the history
  • Loading branch information
MaanavD committed Nov 21, 2023
1 parent 0901514 commit 9d928af
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions src/routes/blogs/accelerating-llama-2/+page.svelte
Original file line number Diff line number Diff line change
Expand Up @@ -51,7 +51,7 @@
and
<a href="https://www.linkedin.com/in/parinitaparinita/" class="text-blue-500">Parinita Rahi</a>
</p>
<p class="text-neutral">14TH NOVEMBER, 2023</p>
<p class="text-neutral">14TH NOVEMBER, 2023 <span class="italic text-stone-500">(Updated 22nd November)</span></p>
<div class="py-4">
<p class="mb-4">
Interested in running Llama2 faster? Let us explore how ONNX Runtime can propel your Llama2
Expand Down Expand Up @@ -134,7 +134,7 @@
<p class="mb-4">
Token generation throughput below is the average throughput of the first 256 tokens generated.
We see up to ~1.4X (7B) and ~1.7X (13B) gains in token generation throughput when compared to
PyTorch eager and compile modes.
PyTorch compile mode.
</p>

<div class="grid grid-cols-1 lg:grid-cols-2 gap-4">
Expand Down

0 comments on commit 9d928af

Please sign in to comment.