[Coalesce]: Enhance the Intel coalescing pass to support while loops. #4290

etiotto · 2025-05-23T16:02:33Z

Enhance the Intel GPU coalescing pass to handle scf::WhileOp.

…ad/descriptor_store operation that uses it Signed-off-by: Tiotto, Ettore <[email protected]>

Signed-off-by: Tiotto, Ettore <[email protected]>

…o_block_ptr.1

Signed-off-by: Tiotto, Ettore <[email protected]>

…o_block_ptr.1

Signed-off-by: Tiotto, Ettore <[email protected]>

Copilot

Pull Request Overview

Enhance the Intel GPU coalescing and layout-propagation passes to handle scf::WhileOp and generalize descriptor-to-pointer lowering.

Extend findDefiningMakeTensorPtrOp, propagation, and templated loops in Coalesce.cpp to support scf::WhileOp and scf::ConditionOp.
Add updateAdvanceOpChain in RemoveLayoutConversions.cpp for chained AdvanceOps and insert module verification asserts.
Refactor TensorDescToBlockPointer.cpp to simplify descriptor rewriting, unify pointer creation, and remove legacy helper functions.

Reviewed Changes

Copilot reviewed 3 out of 7 changed files in this pull request and generated 2 comments.

File	Description
third_party/intel/lib/TritonIntelGPUTransforms/RemoveLayoutConversions.cpp	Add `updateAdvanceOpChain` and rewrite store logic with verification asserts
third_party/intel/lib/TritonIntelGPUTransforms/Coalesce.cpp	Templatize propagation, add `WhileOp`/`ConditionOp` support, refactor debug logging
third_party/intel/lib/Dialect/Triton/Transforms/TensorDescToBlockPointer.cpp	Overhaul descriptor-to-block-pointer pass, unify `MakeTensorPtrOp` creation

Files not reviewed (4)

test/Triton/Intel/TensorDescToBlockPointer/basic.mlir: Language not supported
test/Triton/Intel/TensorDescToBlockPointer/loop.mlir: Language not supported
test/TritonIntelGPU/backward_combine_dpas_dot_layout.mlir: Language not supported
test/TritonIntelGPU/coalesce.mlir: Language not supported

Comments suppressed due to low confidence (2)

third_party/intel/lib/TritonIntelGPUTransforms/RemoveLayoutConversions.cpp:793

The variable value is undefined here; it should reference storeOp.getValue() or another valid operand value.

Value dataToStore = getValueAs(value, encoding);

third_party/intel/lib/Dialect/Triton/Transforms/TensorDescToBlockPointer.cpp:149

Always pushing a zero offset discards the original op.getIndices(). Use the descriptor's indices for offsets instead of a constant zero.

offsets.push_back(zero);

third_party/intel/lib/TritonIntelGPUTransforms/Coalesce.cpp

Signed-off-by: Tiotto, Ettore <[email protected]>

whitneywhtsang

please create an issue to track upstreaming while loop support in coalesce pass.

test/TritonIntelGPU/coalesce.mlir

third_party/intel/lib/TritonIntelGPUTransforms/Coalesce.cpp

Signed-off-by: Tiotto, Ettore <[email protected]>

whitneywhtsang

Ettore agreed to revert back to the old debug print lines if we can reuse functions in lib/Dialect/TritonGPU/Transforms/Coalesce.cpp.

whitneywhtsang · 2025-05-26T14:37:02Z

please create an issue to track upstreaming while loop support in coalesce pass.

Created #4307

etiotto added 12 commits May 9, 2025 17:55

Ensure block ptr is created with the same layout as the descriptor_lo…

4384ad1

…ad/descriptor_store operation that uses it Signed-off-by: Tiotto, Ettore <[email protected]>

Remove naked print and unnecessary headers

f0ce91c

Signed-off-by: Tiotto, Ettore <[email protected]>

Merge remote-tracking branch 'origin/main' into etiotto.tensor_desc_t…

04f1c1d

…o_block_ptr.1

WIP: TensorDescToBlockPtr updates

3543e6e

Signed-off-by: Tiotto, Ettore <[email protected]>

WIP: RemoveLAuoyutConversion improvement for tt.advance operation

38ef6c3

Signed-off-by: Tiotto, Ettore <[email protected]>

Merge remote-tracking branch 'origin/main' into etiotto.tensor_desc_t…

e4d5d7d

…o_block_ptr.1

WIP: TensorDescToBlockPtr updates

9eb16ec

Signed-off-by: Tiotto, Ettore <[email protected]>

WIP: TensorDescToBlockPtr updates

7f9bbc9

Signed-off-by: Tiotto, Ettore <[email protected]>

WIP: TensorDescToBlockPtr updates

f6ed66a

Signed-off-by: Tiotto, Ettore <[email protected]>

WIP: TensorDescToBlockPtr updates

cb4bb2e

Signed-off-by: Tiotto, Ettore <[email protected]>

Merge branch 'main' into etiotto.tensor_desc_to_block_ptr.1

f6ce50a

Enhance coalescing pass to support scf::While loop

d1c3d0b

Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto requested a review from Copilot May 23, 2025 16:02

etiotto self-assigned this May 23, 2025

Copilot AI reviewed May 23, 2025

View reviewed changes

third_party/intel/lib/TritonIntelGPUTransforms/Coalesce.cpp Show resolved Hide resolved

third_party/intel/lib/TritonIntelGPUTransforms/Coalesce.cpp Show resolved Hide resolved

etiotto linked an issue May 23, 2025 that may be closed by this pull request

[Coalescing] Failure to support scf::while loops. #4291

Closed

etiotto added 3 commits May 23, 2025 16:41

Merge branch 'main' into etiotto.coalesce.1

1f4ec74

Merge branch 'main' into etiotto.coalesce.1

1cc7fd5

Remove unnecessary changes

d4c2655

Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto requested review from alexbaden, whitneywhtsang, chengjunlu and a team May 23, 2025 17:20

etiotto marked this pull request as ready for review May 23, 2025 17:20

whitneywhtsang reviewed May 23, 2025

View reviewed changes

test/TritonIntelGPU/coalesce.mlir Show resolved Hide resolved

third_party/intel/lib/TritonIntelGPUTransforms/Coalesce.cpp Show resolved Hide resolved

third_party/intel/lib/TritonIntelGPUTransforms/Coalesce.cpp Outdated Show resolved Hide resolved

Address code review comments

65dc3c3

Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto requested a review from whitneywhtsang May 26, 2025 13:04

etiotto enabled auto-merge (squash) May 26, 2025 14:27

whitneywhtsang approved these changes May 26, 2025

View reviewed changes

etiotto merged commit 9e6f975 into main May 26, 2025
15 checks passed

etiotto deleted the etiotto.coalesce.1 branch May 26, 2025 14:33

whitneywhtsang mentioned this pull request May 26, 2025

[CoalescePass] Add while loop support in common #4307

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Coalesce]: Enhance the Intel coalescing pass to support while loops. #4290

[Coalesce]: Enhance the Intel coalescing pass to support while loops. #4290

Uh oh!

etiotto commented May 23, 2025 •

edited by whitneywhtsang

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

whitneywhtsang left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

whitneywhtsang left a comment

Uh oh!

Uh oh!

whitneywhtsang commented May 26, 2025

Uh oh!

Uh oh!

[Coalesce]: Enhance the Intel coalescing pass to support while loops. #4290

[Coalesce]: Enhance the Intel coalescing pass to support while loops. #4290

Uh oh!

Conversation

etiotto commented May 23, 2025 • edited by whitneywhtsang Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

whitneywhtsang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

whitneywhtsang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

whitneywhtsang commented May 26, 2025

Uh oh!

Uh oh!

etiotto commented May 23, 2025 •

edited by whitneywhtsang

Loading