JIT: Always compute loop iteration estimate in loop inversion if we have PGO data #116104

amanasifkhalid · 2025-05-29T17:37:13Z

Ensure loop inversion always comes up with a loop iteration estimate better than BB_LOOP_WEIGHT_SCALE if we have PGO data.

Copilot

Pull Request Overview

This PR updates the loop inversion logic to skip inverting loops that are expected to iterate only a few times, based on profile weight data.

Simplifies the iteration count estimation by using the likely weight of the test block and the called count.
Removes the previous, more complex handling of profile weights and loop entry estimation.

Copilot · 2025-05-29T17:37:36Z

src/coreclr/jit/optimizer.cpp

+    const bool haveProfileWeights = fgIsUsingProfileWeights();
+    if (haveProfileWeights)
+    {
+        loopIterations = bTest->GetTrueEdge()->getLikelyWeight() / BasicBlock::getCalledCount(this);


Consider adding a check to ensure that BasicBlock::getCalledCount(this) does not return zero before performing the division to avoid a potential division by zero error.

Suggested change

loopIterations = bTest->GetTrueEdge()->getLikelyWeight() / BasicBlock::getCalledCount(this);

const auto calledCount = BasicBlock::getCalledCount(this);

if (calledCount == 0)

{

JITDUMP("BasicBlock::getCalledCount returned zero, skipping division.\n");

return false;

}

loopIterations = bTest->GetTrueEdge()->getLikelyWeight() / calledCount;

BasicBlock::getCalledCount is guaranteed to not return zero.

The old computation was loop-entry relative, and I think that is the right think of the iteration count.

Eg for (int i = 0; i < 11; i++)'s iteration count should always be 11.

I've reverted this to the previous computation (should be a no-diff change now), but out of curiosity, why shouldn't we give nested loops additional weight?

dotnet-policy-service · 2025-05-29T17:37:57Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

amanasifkhalid · 2025-05-29T19:31:27Z

cc @dotnet/jit-contrib, @AndyAyersMS PTAL. Diffs show large size decreases (with libraries_tests being the outlier), as well as some size increases from RBO being pessimized by less branch duplication. I'm not sure what the cutoff for inversion should be, so if these diffs seem too big, I can reduce it a bit.

It's worth noting that I'm not cutting off any loops when we don't have PGO data. Since inversion currently runs before optSetBlockWeights, I don't think I can make any assumptions about loop iteration counts here.

AndyAyersMS · 2025-05-29T21:19:41Z

I think this is a tricky one to get right.

If a loop has low average iteration count it can still have instances with high iteration counts.
If a loop has low iteration counts the method with the loop may be called frequently, or the loop may be inside another loop with high iteration counts, etc.

amanasifkhalid · 2025-05-29T21:59:47Z

or the loop may be inside another loop with high iteration counts

In this case, wouldn't we compute a high iteration count for the nested loop too (assuming the parent loop doesn't conditionally execute the child loop)?

I agree that this approach isn't sensitive to the other cases you mentioned. The loop inversion diffs that inspired this change didn't necessarily involve loops with low iteration counts; rather, they were loops that are more likely to fall through than loop, or otherwise weren't likely to run more than once per method call. It feels a bit crude, but I could take a safer approach and skip loops that don't iterate at least twice on average -- in other words, it has to behave like a loop on average to be inverted.

AndyAyersMS · 2025-05-29T22:25:21Z

or the loop may be inside another loop with high iteration counts

In this case, wouldn't we compute a high iteration count for the nested loop too (assuming the parent loop doesn't conditionally execute the child loop)?

Ah, I should have looked more closely. You are computing a method-entry relative count, not a loop-entry relative count... I just assumed "iteration count" meant the latter.

So yes what you are doing would handle the nested case ok.

I'd like to see what a size-based heuristic looks like. I think that is perhaps less prone to mis-estimating importance or potential benefit from inversion (?).

amanasifkhalid · 2025-05-29T22:30:11Z

I'd like to see what a size-based heuristic looks like.

I was thinking of reusing the size heuristic you added for loop cloning: If a loop is too big to likely benefit from cloning, then it's probably not tight enough to benefit from inversion. Does that seem like a reasonable starting point?

I don't think we can easily separate out the size heuristic change from #116017, since we need the loop data structures computed to easily compute the loop size. I can push a change to that PR with the size restriction and see how the diffs change.

AndyAyersMS · 2025-05-29T22:36:32Z

I'd like to see what a size-based heuristic looks like.

I was thinking of reusing the size heuristic you added for loop cloning: If a loop is too big to likely benefit from cloning, then it's probably not tight enough to benefit from inversion. Does that seem like a reasonable starting point?

Sure, using the same size threshold seems reasonable.

amanasifkhalid · 2025-05-30T19:41:55Z

Based on my trial and error with different size limits for loop inversion (comment), I think we're unlikely to pursue a loop iteration heuristic for now. I'm going to remove the heuristic portion and just make this into a refactor of the loop iteration computation, so that we're at least always doing it.

…manasifkhalid/runtime into loop-inversion-iteration-count

amanasifkhalid added 2 commits May 29, 2025 13:08

More precise loop iteration computation

672b2c0

Skip loops with low iteration counts

d8e96d6

Copilot AI review requested due to automatic review settings May 29, 2025 17:37

github-actions bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label May 29, 2025

dotnet-policy-service bot assigned amanasifkhalid May 29, 2025

Copilot AI reviewed May 29, 2025

View reviewed changes

Comments

464d8e4

Merge branch 'main' into loop-inversion-iteration-count

21503d1

build-analysis bot mentioned this pull request May 30, 2025

The Operation will be canceled. The next steps may not contain expected logs. dotnet/dnceng#3008

Open

3 tasks

Remove heuristic

482aec8

amanasifkhalid changed the title ~~JIT: Don't invert loops with low iterations counts~~ JIT: Always compute loop iteration estimate in loop inversion if we have PGO data May 30, 2025

amanasifkhalid added 3 commits May 30, 2025 15:52

Merge branch 'loop-inversion-iteration-count' of https://github.com/a…

9aee3aa

…manasifkhalid/runtime into loop-inversion-iteration-count

Merge branch 'main' into loop-inversion-iteration-count

c08588c

Use loop entry-relative iteration computation

0e0ba90

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

JIT: Always compute loop iteration estimate in loop inversion if we have PGO data #116104

JIT: Always compute loop iteration estimate in loop inversion if we have PGO data #116104

amanasifkhalid commented May 29, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI May 29, 2025

Uh oh!

amanasifkhalid May 29, 2025

Uh oh!

AndyAyersMS May 30, 2025

Uh oh!

amanasifkhalid Jun 2, 2025

Uh oh!

dotnet-policy-service bot commented May 29, 2025

Uh oh!

amanasifkhalid commented May 29, 2025 •

edited

Loading

Uh oh!

AndyAyersMS commented May 29, 2025

Uh oh!

amanasifkhalid commented May 29, 2025

Uh oh!

AndyAyersMS commented May 29, 2025

Uh oh!

amanasifkhalid commented May 29, 2025

Uh oh!

AndyAyersMS commented May 29, 2025

Uh oh!

amanasifkhalid commented May 30, 2025

Uh oh!

Uh oh!

-        loopIterations = bTest->GetTrueEdge()->getLikelyWeight() / BasicBlock::getCalledCount(this);
+        const auto calledCount = BasicBlock::getCalledCount(this);
+        if (calledCount == 0)
+        {
+            JITDUMP("BasicBlock::getCalledCount returned zero, skipping division.\n");
+            return false;
+        }
+        loopIterations = bTest->GetTrueEdge()->getLikelyWeight() / calledCount;

JIT: Always compute loop iteration estimate in loop inversion if we have PGO data #116104

Are you sure you want to change the base?

JIT: Always compute loop iteration estimate in loop inversion if we have PGO data #116104

Conversation

amanasifkhalid commented May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI May 29, 2025

Choose a reason for hiding this comment

Uh oh!

amanasifkhalid May 29, 2025

Choose a reason for hiding this comment

Uh oh!

AndyAyersMS May 30, 2025

Choose a reason for hiding this comment

Uh oh!

amanasifkhalid Jun 2, 2025

Choose a reason for hiding this comment

Uh oh!

dotnet-policy-service bot commented May 29, 2025

Uh oh!

amanasifkhalid commented May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AndyAyersMS commented May 29, 2025

Uh oh!

amanasifkhalid commented May 29, 2025

Uh oh!

AndyAyersMS commented May 29, 2025

Uh oh!

amanasifkhalid commented May 29, 2025

Uh oh!

AndyAyersMS commented May 29, 2025

Uh oh!

amanasifkhalid commented May 30, 2025

Uh oh!

Uh oh!

amanasifkhalid commented May 29, 2025 •

edited

Loading

amanasifkhalid commented May 29, 2025 •

edited

Loading