register fused rmsnorm as pytorch custom op #296

tianyu-l · 2024-05-02T21:55:06Z

Stack from ghstack (oldest at bottom):

-> register fused rmsnorm as pytorch custom op #296

[ghstack-poisoned]

ghstack-source-id: 401d968feaa2e58eedb573c07739694358a8d4a6 Pull Request resolved: #296

msaroufim · 2024-12-04T03:15:39Z

n00b but is there any benefit in making a triton function a custom op? User defined triton functions should just work with compile

tianyu-l · 2024-12-04T05:07:34Z

@msaroufim Hmm I don't know much. Maybe it depends on the way the triton function is wrapped? In this PR there is an autograd.function wrapping the triton function. cc: @Chillee @lessw2020 for more context.

Beyond compile, making it a custom op can be helpful for other things, e.g. registering customized DTensor sharding propagation rules, allow PP tracing to work (although we no longer have it in torchtitan) etc.

msaroufim · 2024-12-05T23:52:49Z

should we perhaps just remove this custom kernel? I believe in pytorch nightlies rms norm should now work

tianyu-l · 2024-12-06T02:18:07Z

I believe in pytorch nightlies rms norm should now work

@msaroufim oh which rms norm are you referring to?

register fused rmsnorm as pytorch custom op

c3daa9a

[ghstack-poisoned]

tianyu-l added a commit that referenced this pull request May 2, 2024

register fused rmsnorm as pytorch custom op

ed5f13f

ghstack-source-id: 401d968feaa2e58eedb573c07739694358a8d4a6 Pull Request resolved: #296

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label May 2, 2024

tianyu-l marked this pull request as draft May 2, 2024 21:55

wanchaol mentioned this pull request May 2, 2024

turn off dynamic shape for torch.compile #297

Merged

tianyu-l mentioned this pull request May 8, 2024

Make fused RMSNorm a registered op #199

Closed

tianyu-l added a commit that referenced this pull request Aug 16, 2024

register fused rmsnorm as pytorch custom op

faa78a4

ghstack-source-id: 401d968feaa2e58eedb573c07739694358a8d4a6 Pull Request resolved: #296

tianyu-l force-pushed the gh/tianyu-l/11/base branch from 17cda29 to e34d2ac Compare August 16, 2024 21:00

tianyu-l force-pushed the gh/tianyu-l/11/head branch from 1959d5f to c3daa9a Compare August 16, 2024 21:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

register fused rmsnorm as pytorch custom op #296

register fused rmsnorm as pytorch custom op #296

tianyu-l commented May 2, 2024 •

edited

Loading

msaroufim commented Dec 4, 2024

tianyu-l commented Dec 4, 2024

msaroufim commented Dec 5, 2024

tianyu-l commented Dec 6, 2024

register fused rmsnorm as pytorch custom op #296

Are you sure you want to change the base?

register fused rmsnorm as pytorch custom op #296

Conversation

tianyu-l commented May 2, 2024 • edited Loading

msaroufim commented Dec 4, 2024

tianyu-l commented Dec 4, 2024

msaroufim commented Dec 5, 2024

tianyu-l commented Dec 6, 2024

tianyu-l commented May 2, 2024 •

edited

Loading