[MatchTargetSize] Extend for-loop canonicalization pattern #2045

jopperm · 2024-08-29T14:03:16Z

Support canonicalization of dependent scf.for loops by re-gluing individual results after the loop.

See #1947 for more context / the complete PoC.

Signed-off-by: Julian Oppermann <[email protected]>

third_party/intel/lib/TritonIntelGPUTransforms/MatchTargetSize.cpp

Dewei-Wang-sh · 2024-09-02T02:01:04Z

third_party/intel/lib/TritonIntelGPUTransforms/MatchTargetSize.cpp

+
+ // If a results is used by another scf.for loop, we re-glue the individual
+ // results together to allow canonicalization of the dependent loop, too.
+ llvm::SmallDenseMap<OpResult, Value> reglueMap;


we can extend the new condition by handling the scf.yield operation.
no need to add a map.
if the scf.yield has a operand whose def is glue, then split it into two operands

I've simplified the code to always insert glue ops after the loop, and replacing results immediately without storing them in a map.

Dewei-Wang-sh

LGTM

[MatchTargetSize] Extend for-loop canonicalization patterh

a285c54

Signed-off-by: Julian Oppermann <[email protected]>

jopperm requested review from whitneywhtsang, etiotto, Dewei-Wang-sh and a team August 29, 2024 14:03

jopperm self-assigned this Aug 29, 2024

jopperm linked an issue Aug 29, 2024 that may be closed by this pull request

[#6 Attention Performance] extend attention support for Causal = True #1102

Open

jopperm added 2 commits August 29, 2024 16:03

Merge branch 'llvm-target' into jopperm/mts-dependent-loops

e908495

Merge branch 'llvm-target' into jopperm/mts-dependent-loops

9d7df40

etiotto reviewed Aug 29, 2024

View reviewed changes

third_party/intel/lib/TritonIntelGPUTransforms/MatchTargetSize.cpp Outdated Show resolved Hide resolved

third_party/intel/lib/TritonIntelGPUTransforms/MatchTargetSize.cpp Outdated Show resolved Hide resolved

jopperm added 2 commits August 29, 2024 18:52

Merge branch 'llvm-target' into jopperm/mts-dependent-loops

c0d2e87

Nits.

173dd19

victor-eds approved these changes Aug 30, 2024

View reviewed changes

Dewei-Wang-sh requested changes Sep 2, 2024

View reviewed changes

jopperm added 3 commits September 2, 2024 15:25

Always insert glue ops after the loop.

57c160b

Format.

bf5a11b

Merge branch 'llvm-target' into jopperm/mts-dependent-loops

8a95fc6

jopperm requested review from Dewei-Wang-sh and etiotto September 4, 2024 07:11

Dewei-Wang-sh approved these changes Sep 9, 2024

View reviewed changes

Dewei-Wang-sh merged commit 66bf5d8 into llvm-target Sep 9, 2024
4 checks passed

whitneywhtsang deleted the jopperm/mts-dependent-loops branch September 9, 2024 21:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MatchTargetSize] Extend for-loop canonicalization pattern #2045

[MatchTargetSize] Extend for-loop canonicalization pattern #2045

jopperm commented Aug 29, 2024

Dewei-Wang-sh Sep 2, 2024

jopperm Sep 2, 2024

Dewei-Wang-sh left a comment

[MatchTargetSize] Extend for-loop canonicalization pattern #2045

[MatchTargetSize] Extend for-loop canonicalization pattern #2045

Conversation

jopperm commented Aug 29, 2024

Dewei-Wang-sh Sep 2, 2024

Choose a reason for hiding this comment

jopperm Sep 2, 2024

Choose a reason for hiding this comment

Dewei-Wang-sh left a comment

Choose a reason for hiding this comment