Skip to content

Commit

Permalink
Update Mixtral perf figures
Browse files Browse the repository at this point in the history
Perf measurements taken with 3bb01f9
  • Loading branch information
yieldthought committed Jun 3, 2024
1 parent e08a919 commit 0fad284
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -56,7 +56,7 @@
| [LLaMA-2-70B-decode](./models/demos/t3000/llama2_70b) | Tensor Parallel | 129th | 32 | 8.5 t/s/u - 272 t/s | 13.9 t/s/u - 445 t/s | 20 t/s/u |
| [LLaMA-3-70B-decode](./models/demos/t3000/llama3_70b) | Tensor Parallel | 129th | 32 | 8.1 t/s/u - 257 t/s | 13.9 t/s/u - 445 t/s | 20 t/s/u |
| [Falcon40B-decode](./models/demos/t3000/falcon40b) | Tensor Parallel | 129th | 32 | 1.5 t/s/u - 48 t/s | 14.0 t/s/u - 448 t/s | 30 t/s/u |
| [Mixtral7Bx8-decode](./models/demos/t3000/mixtral8x7b) | Tensor Parallel | 129th | 32 | 3.6 t/s/u - 114 t/s | 23.5 t/s/u - 752 t/s | 28 t/s/u |
| [Mixtral7Bx8-decode](./models/demos/t3000/mixtral8x7b) | Tensor Parallel | 129th | 32 | 7.0 t/s/u - 225 t/s | 27.0 t/s/u - 864 t/s | 28 t/s/u |
| ResNet50 | Data Parallel | coming soon | | | | |

## Using TT-NN ops and tensors
Expand Down

0 comments on commit 0fad284

Please sign in to comment.