-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[On Hold] Upgrade nanoGPT #3974
Comments
Update nanoGPT model with following:
|
This replaces old issue #2193 |
Here is the commit for Loading weights from weka path for nanogpt model Link |
Currently, We have moved weights to Weka path and uplifted the model with supported TT ops. |
This PR(#4221) has following
We face a drop in PCC (0.99 to 0.98) for the whole model while using tt_lib.tensor.softmax in the attention submodule. |
The tasks
are linked to #4342 |
Can you make a PR for this? |
Yes, you can wait. |
PR #4221 merged. We can proceed with creating PRs for other commits. |
No description provided.
The text was updated successfully, but these errors were encountered: