[TTIG-TO-LLVM] Support row-vector broadcasts and make_range #2046

jopperm · 2024-08-29T15:17:19Z

Minimum-viable support for lowering broadcasts and tt.make_range ops on the advance path.

See #1947 for more context / the complete PoC.

…nce path Signed-off-by: Julian Oppermann <[email protected]>

third_party/intel/lib/TritonIntelGPUToLLVM/TypeConverter.cpp

third_party/intel/lib/TritonIntelGPUToLLVM/PipelineManager.h

third_party/intel/lib/TritonIntelGPUToLLVM/TritonOpsToLLVM.cpp

whitneywhtsang · 2024-09-12T03:31:13Z

third_party/intel/lib/TritonIntelGPUToLLVM/TritonOpsToLLVM.cpp

@@ -573,8 +579,36 @@ class BroadcastOpConversion
 LogicalResult
 matchAndRewrite(triton::BroadcastOp op, OpAdaptor adaptor,
 ConversionPatternRewriter &rewriter) const override {
- rewriter.replaceOp(op, adaptor.getSrc());
- return success();
+ constexpr unsigned subgroupSize = 16;


would it be better to get subgroup size from module?

I'm querying the triton_intel_gpu.min_sg_size attribute now, is that the correct one?

I would query triton::gpu::TritonGPUDialect::getThreadsPerWarp(mod), as the minimum one may not be the selected one, although it always is on the advanced path.

On the advanced path, threads-per-warp is always 1 IIRC.

Not at this stage, I can see that multiple patterns query getThreadsPerWarp in this file.

You're right of course. Fixed.

Dewei-Wang-sh · 2024-09-12T06:06:27Z

third_party/intel/lib/TritonIntelGPUToLLVM/TritonOpsToLLVM.cpp

+ // FIXME: The real lowering has to take the layout into account. Here, we're
+ // just emitting a sequence of ints. Use
+ // `third_party/intel/lib/TritonIntelGPUToLLVM/MakeRangeOpToLLVM.cpp`
+ // instead!


typically, we have a separate conversion for triton ops, that's why this file stands.
what's meaning here?

I've rephrased the comment to a bit to explain the lowering to a sequence of ints is the correct lowering for the advanced path, assuming dense layouts there.

third_party/intel/lib/TritonIntelGPUToLLVM/TritonOpsToLLVM.cpp

[[TTIG-TO-LLVM]] Support row-vector broadcasts and make_range on adva…

4a55acb

…nce path Signed-off-by: Julian Oppermann <[email protected]>

jopperm requested review from whitneywhtsang, etiotto, Dewei-Wang-sh and a team August 29, 2024 15:17

jopperm self-assigned this Aug 29, 2024

jopperm changed the title ~~[[TTIG-TO-LLVM]] Support row-vector broadcasts and make_range~~ [TTIG-TO-LLVM] Support row-vector broadcasts and make_range Aug 29, 2024

jopperm linked an issue Aug 29, 2024 that may be closed by this pull request

[#6 Attention Performance] extend attention support for Causal = True #1102

Open

Merge branch 'llvm-target' into jopperm/broadcast-makerange-to-llvm

3873bc8

Dewei-Wang-sh reviewed Sep 2, 2024

View reviewed changes

third_party/intel/lib/TritonIntelGPUToLLVM/TypeConverter.cpp Outdated Show resolved Hide resolved

victor-eds approved these changes Sep 2, 2024

View reviewed changes

etiotto reviewed Sep 5, 2024

View reviewed changes

third_party/intel/lib/TritonIntelGPUToLLVM/PipelineManager.h Outdated Show resolved Hide resolved

third_party/intel/lib/TritonIntelGPUToLLVM/TritonOpsToLLVM.cpp Outdated Show resolved Hide resolved

jopperm added 4 commits September 11, 2024 12:30

Merge branch 'llvm-target' into jopperm/broadcast-makerange-to-llvm

a8171f9

Drop redundant boolean arg.

5263e0a

Drop TypeConverter change.

0b1acb6

Clean diff.

552bb8d

whitneywhtsang approved these changes Sep 12, 2024

View reviewed changes

Dewei-Wang-sh reviewed Sep 12, 2024

View reviewed changes

third_party/intel/lib/TritonIntelGPUToLLVM/TritonOpsToLLVM.cpp Outdated Show resolved Hide resolved

jopperm added 3 commits September 12, 2024 09:11

Use of auto; comment.

c0e4b8d

Get SG size from module.

98b9f81

WS.

a7a8be9

Dewei-Wang-sh approved these changes Sep 12, 2024

View reviewed changes

jopperm added 2 commits September 12, 2024 10:27

Get SG size from module.

8024e82

Use threads per warp.

ea9878e

jopperm merged commit 389f5dd into llvm-target Sep 12, 2024
4 checks passed

jopperm deleted the jopperm/broadcast-makerange-to-llvm branch September 12, 2024 15:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TTIG-TO-LLVM] Support row-vector broadcasts and make_range #2046

[TTIG-TO-LLVM] Support row-vector broadcasts and make_range #2046

jopperm commented Aug 29, 2024

whitneywhtsang Sep 12, 2024

jopperm Sep 12, 2024

jopperm Sep 12, 2024

whitneywhtsang Sep 12, 2024

jopperm Sep 12, 2024

whitneywhtsang Sep 12, 2024

jopperm Sep 12, 2024

Dewei-Wang-sh Sep 12, 2024

jopperm Sep 12, 2024

[TTIG-TO-LLVM] Support row-vector broadcasts and make_range #2046

[TTIG-TO-LLVM] Support row-vector broadcasts and make_range #2046

Conversation

jopperm commented Aug 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment