[BUG] Why is the backpropagation calculation so slow when I use the mamba network? #220

1325116124 · 2024-05-26T17:46:40Z

When I used the mamba network, I defined a loss to test backpropagation and found that the calculation was very slow. Setting the len length to 1024 requires a long waiting time. code show as below：

`import torch
import torch.nn as nn
from zeta.nn import MambaBlock

block = MambaBlock(dim=512, depth=1)
x = torch.randn(1, 1024, 512)
target = torch.randn(1, 1024, 512)
loss_fn = nn.MSELoss()

y = block(x)
loss = loss_fn(y, target)
loss.backward()
print("Output shape:", y.shape)
print("Loss value:", loss.item())
`

Upvote & Fund

We're using Polar.sh so you can upvote and help fund this issue.
We receive the funding once the issue is completed & confirmed by you.
Thank you in advance for helping prioritize & fund our backlog.

github-actions · 2024-05-26T17:47:04Z

Hello there, thank you for opening an Issue ! 🙏🏻 The team was notified and they will get back to you asap.

kyegomez · 2024-06-13T02:39:45Z

@1325116124 its using the mamba scan, or SSM, it should be updated soon!

Alex-Naxitus · 2024-07-10T09:21:48Z

I had the same problem. I don't suppose you managed to fix it right ?

kyegomez · 2024-07-12T03:39:40Z

@Alex-Naxitus yes, it should be updated now!

kyegomez · 2024-08-13T23:53:32Z

@Alex-Naxitus @1325116124 it was the backscan that was really slow, let me know so I can close this issuee!

github-actions · 2024-10-13T12:52:33Z

Stale issue message

1325116124 added the bug Something isn't working label May 26, 2024

1325116124 assigned kyegomez May 26, 2024

github-actions bot added the no-issue-activity label Oct 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Why is the backpropagation calculation so slow when I use the mamba network? #220

[BUG] Why is the backpropagation calculation so slow when I use the mamba network? #220

1325116124 commented May 26, 2024 •

edited by polar-sh bot

Loading

github-actions bot commented May 26, 2024

kyegomez commented Jun 13, 2024

Alex-Naxitus commented Jul 10, 2024

kyegomez commented Jul 12, 2024

kyegomez commented Aug 13, 2024

github-actions bot commented Oct 13, 2024

[BUG] Why is the backpropagation calculation so slow when I use the mamba network? #220

[BUG] Why is the backpropagation calculation so slow when I use the mamba network? #220

Comments

1325116124 commented May 26, 2024 • edited by polar-sh bot Loading

Upvote & Fund

github-actions bot commented May 26, 2024

kyegomez commented Jun 13, 2024

Alex-Naxitus commented Jul 10, 2024

kyegomez commented Jul 12, 2024

kyegomez commented Aug 13, 2024

github-actions bot commented Oct 13, 2024

1325116124 commented May 26, 2024 •

edited by polar-sh bot

Loading