Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

#6938: Implement softplus as a single kernel #8249

Merged
merged 1 commit into from
May 31, 2024
Merged

#6938: Implement softplus as a single kernel #8249

merged 1 commit into from
May 31, 2024

Conversation

esmalTT
Copy link
Contributor

@esmalTT esmalTT commented May 8, 2024

Convert composite softplus operation to a single kernel implementation to improve performance.

Previously, the composite version of softplus took 0.15 ms for an input shape of (32, 32, 5120). After this change, the same input shape now takes approx. 0.01 ms.

Due to problems with register pressure in the GS implementation of this kernel, we have decided to remove support for that architecture in this PR. Because mish uses softplus internally, it will also no longer be supported on Grayskull. I will open an issue to track this regression.

Blocking issues:

Closes #6938

@esmalTT esmalTT added the mamba label May 8, 2024
@esmalTT esmalTT self-assigned this May 8, 2024
@esmalTT esmalTT changed the title Fuse softplus operation #6938: Fuse softplus operation May 8, 2024
@esmalTT esmalTT force-pushed the esmal/softplus branch 14 times, most recently from 7aa78e1 to 50c79d5 Compare May 13, 2024 11:58
@esmalTT esmalTT force-pushed the esmal/softplus branch 7 times, most recently from 04a5331 to 453df35 Compare May 15, 2024 13:46
@esmalTT esmalTT merged commit 0598421 into main May 31, 2024
5 checks passed
@esmalTT esmalTT deleted the esmal/softplus branch June 5, 2024 00:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Faster softplus needed
5 participants