-
-
Notifications
You must be signed in to change notification settings - Fork 39
Issues: kyegomez/zeta
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[BUG] No parameter named Something isn't working
gamma
in decoupled_optimizer.py
bug
#290
opened Oct 12, 2024 by
erlebach
[Question] Specifying the device to run on
bug
Something isn't working
#288
opened Oct 12, 2024 by
erlebach
Should I use linear layers for the input and output of FlashAttention?
bug
Something isn't working
no-issue-activity
#247
opened Jul 21, 2024 by
chenhengx0101
[BUG] Why is the backpropagation calculation so slow when I use the mamba network?
bug
Something isn't working
no-issue-activity
#220
opened May 26, 2024 by
1325116124
ProTip!
Follow long discussions with comments:>50.