Opt LN_sharded and SMX_sharded #4147

yugaoTT · 2023-12-02T19:28:20Z

for SMX, remove wait_front to avoid unpacker packer sync
for LN, finished compute for one block in one run before switching to another compute section.

farbabi · 2023-12-05T19:01:30Z

models/demos/metal_BERT_large_11/tests/test_perf_bert11.py

@@ -171,7 +171,7 @@ def test_perf_virtual_machine(
 @pytest.mark.models_performance_bare_metal
 @pytest.mark.parametrize(
    "expected_inference_time, expected_compile_time, inference_iterations",
-    ([0.0375, 10, 10],),
+    ([0.0364, 10, 10],),


To tighten inference time tolerance, please run the model 3 times on CI and average the inference time and consider 15% tolerance.

Thanks for the advice, I have ran it for 3 times and passed with fps ~338

Hey @farbabi, if there is no other things needs change/test, could I get an approval? thanks

yugaoTT · 2023-12-06T16:58:20Z

@boris-drazic just a reminder that the PR needs your approval :). I'd appreciate it if you can take a look at the changes

yugaoTT requested a review from tt-aho December 2, 2023 19:28

yugaoTT requested review from boris-drazic and farbabi as code owners December 2, 2023 19:28

yugaoTT self-assigned this Dec 4, 2023

yugaoTT added bert P1_feature_needed metal tt-metal issue labels Dec 4, 2023

yugaoTT temporarily deployed to dev December 4, 2023 18:19 — with GitHub Actions Inactive

farbabi suggested changes Dec 5, 2023

View reviewed changes

yugaoTT temporarily deployed to dev December 5, 2023 19:40 — with GitHub Actions Inactive

yugaoTT temporarily deployed to dev December 5, 2023 20:16 — with GitHub Actions Inactive

yugaoTT temporarily deployed to dev December 5, 2023 20:53 — with GitHub Actions Inactive

boris-drazic approved these changes Dec 6, 2023

View reviewed changes

yugaoTT had a problem deploying to dev December 6, 2023 23:28 — with GitHub Actions Failure

yugaoTT temporarily deployed to dev December 6, 2023 23:28 — with GitHub Actions Inactive

farbabi approved these changes Dec 7, 2023

View reviewed changes

yugaoTT force-pushed the BERT_large_sharded branch from 65447a0 to 3eb0ec7 Compare December 7, 2023 20:55

yugaoTT temporarily deployed to dev December 7, 2023 20:56 — with GitHub Actions Inactive

yugaoTT temporarily deployed to production December 7, 2023 21:34 — with GitHub Actions Inactive

yugaoTT added 5 commits December 7, 2023 18:32

#0: rebase to main

a2d4f57

#3629: fix bug in LN, bert pcc back to normal

66657dd

#3629: remove redundant wait_front

bfb5b4e

#0: remove extra wait_front

aa374ea

#0: set target fps to 330

cbe97c2

yugaoTT force-pushed the BERT_large_sharded branch from 3eb0ec7 to cbe97c2 Compare December 7, 2023 23:32

yugaoTT merged commit 5aeccad into main Dec 7, 2023
3 checks passed

tt-aho deleted the BERT_large_sharded branch June 25, 2024 20:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Opt LN_sharded and SMX_sharded #4147

Opt LN_sharded and SMX_sharded #4147

yugaoTT commented Dec 2, 2023

farbabi Dec 5, 2023

yugaoTT Dec 5, 2023

yugaoTT Dec 6, 2023

yugaoTT commented Dec 6, 2023

Opt LN_sharded and SMX_sharded #4147

Opt LN_sharded and SMX_sharded #4147

Conversation

yugaoTT commented Dec 2, 2023

farbabi Dec 5, 2023

Choose a reason for hiding this comment

yugaoTT Dec 5, 2023

Choose a reason for hiding this comment

yugaoTT Dec 6, 2023

Choose a reason for hiding this comment

yugaoTT commented Dec 6, 2023