-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug Report] ttnn.mean op - Data Mismatch #13621
Comments
@ntarafdar @sjameelTT can you please help me to find owners for this issue? |
hey @sdjordjevicTT asking around, its a reduction op who doesn't have an owner , will ask ttnn ppl and get back to you |
@sdjordjevicTT asked around and since there is no other owner for this, the TMs team will have to take this. |
Thanks @ntarafdar for picking this up. Great, I believe that should work for us. |
moving to a P1 issue. @sdjordjevicTT pls comment if you believe the P0 is justified. |
@nvukobratTT can comment more about priority, but I think this issue blocks Llama 3B bring-up on the Forge side. |
Confirming what @sdjordjevicTT mentioned, this one is a blocker for the Open Llama 3B model. Additional details can be found on the MLIR issue as well: |
Spoke to Jasmine, and @bbradelTT is for now taking over reductions. I'm reassigning this to him. |
I tried to find out if there's any point at which there's a big drop off. Seemed like it might be somewhere between 1200 and 1400, but the PCC value goes up and down a fair amount:
|
Hi @bbradelTT do we have some updates regarding this missmatch problem? |
@sdjordjevicTT Unfortunately we need to overhaul reduce. I won't have concrete updates for a while. |
@bbradelTT thanks for the details. Can you clarify the following:
To be certain that this issue is properly tracked, I'm re-adding the P0 label once again. Please correct me if I'm missing some context as to why this one should still be a P1 issue. Thanks! |
Update for today:
|
Thanks for pushing this one further @bbradelTT! Much appreciated 🙌 |
Update for today:
|
Thanks for the update and for letting us know Borys :)) It's valuable for us to know the state of the issues, and when we expect it to be resolved, so that we can plan accordingly on our side as well. Thanks once again! |
Describe the bug
The ttnn.mean throws assertion error because of data mismatch between PyTorch and TTNN output and the pcc is dropped to 0.72 when the input tensor of (1, 12, 3200) and dim = -1 is passed to ttnn.mean op.
For more context, here is the exact error message
To Reproduce
Run the following test:
Expected behavior
The data mismatch between PyTorch and TTNN output should be resolved.
The text was updated successfully, but these errors were encountered: