JIT: Try to maintain fallthrough in `Compiler::fgSplitEdge` #107419

amanasifkhalid · 2024-09-05T18:28:16Z

Given some branch curr -> succ, fgSplitEdge introduces an intermediary block to create the form curr -> newBlock -> succ. The lexical placement of newBlock wouldn't matter if it weren't for the fact that we call fgSplitEdge during LSRA, after we've reordered blocks. Ideally, we wouldn't introduce any new blocks after establishing layout, but for the time being, always placing newBlock after curr to create fallthrough -- effectively moving the curr -> succ branch up a block, and not introducing new branches -- seems to be a decent compromise.

dotnet-policy-service · 2024-09-05T18:28:44Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

amanasifkhalid · 2024-09-06T16:11:37Z

cc @dotnet/jit-contrib, @AndyAyersMS PTAL. Diffs are large, though they're inflated by collections with tiering (libraries_tests in particular). This change in placement is particularly impactful when we aren't optimizing, so we can't rely on block layout to fix things later -- though it looks like there are plenty of instances where LSRA introduces a new block after layout when we are optimizing, and this new placement is better at maintaining fallthrough.

AndyAyersMS · 2024-09-06T16:55:43Z

Seems like perhaps fgSplitEdge should behave differently before/after layout? Before the placement is not important, after, it is...?

That being said, I have seen those LSRA blocks end up very poorly placed, and we might question whether after source or before target is the right location for the new block, when source and target blocks are not adjacent, or whether we just (as we've discussed) redo layout if LSRA makes changes.

amanasifkhalid · 2024-09-06T17:09:50Z

I suppose we could differentiate between before/after layout, though since we're seeing this block placement affect FullOpts with some frequency, it might be more worthwhile to get layout working after LSRA.

That being said, I have seen those LSRA blocks end up very poorly placed, and we might question whether after source or before target is the right location for the new block

Between the two, I think placing after source is better most of the time? If the target has multiple preds, placing before target could potentially break up hotter fallthrough. If the source has only one successor, then placing after the source doesn't change anything, but if the source is a BBJ_COND, then the worst-case scenario is we had some layout like source, falseTarget, target, and now we have source, newBlock, falseTarget, target; we no longer have fallthrough into either successor of source.

If you'd like, I can put this PR on-hold and work on getting layout working after LSRA. For that, I'm thinking we can continue to run layout before LSRA, and detect if we need to run it again after, just to minimize regressions. Once we've refactored switch recognition to not depend on lexical layout (#107076), we should be able to only run layout once, after LSRA.

AndyAyersMS · 2024-09-06T17:25:36Z

I think it makes sense to look into fixing layout post LSRA first. If that turns out to be complicated, then perhaps we can reconsider and do this first.

I believe LSRA will only split "critical edges"— edges where the source has multiple flow successors and the target has multiple flow predecessors. If we trust the initial layout then if source and target are adjacent then putting the block in between is the right thing; if not, presumably both source and target have other preferred partners and if those partners are adjacent then the new block should be placed out of the way somewhere where it won't break up any other important flow (though not necessarily at the end of the method / region, which is what I commonly see). If the preferred source or target partner is not adjacent (or maybe if no successor/predecessor is adjacent), then we should be placing after source or before target depending.

This calculus would be easier if we hand the flow scoring that we are envisioning for k-opt, as we could evaluate the different options and pick the one that has the best score.

amanasifkhalid · 2024-09-10T13:26:28Z

With #107483 merged, the diffs are still big, though FullOpts diffs are concentrated in coreclr_tests and libraries_tests. After looking at the JIT dumps for a few examples, I see that fgSplitEdge's block placement during LSRA, as expected, is no longer meaningful in FullOpts, as block layout handles it. But the call site during profile incorporation is now the source of diffs, thanks to various early phases having dependencies on lexical block ordering. fgUpdateFlowGraph is one obvious culprit -- I can look into refactoring that next.

@AndyAyersMS are you ok taking this change as-is, or would you prefer we whittle down the churn as much as possible?

This reverts commit c65a99b.

Copilot

Pull Request Overview

This PR modifies the JIT's fgSplitEdge method to better maintain fallthrough when splitting edges during LSRA (Linear Scan Register Allocation). The key improvement is placing the new intermediary block more strategically to preserve existing control flow patterns and avoid introducing unnecessary branches.

Improves block placement strategy in fgSplitEdge to maintain fallthrough when possible
Simplifies weight calculation logic by using existing predecessor edge weight directly
Updates documentation to reflect the cleaner implementation

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
src/coreclr/jit/fgbasic.cpp	Refactors `fgSplitEdge` to place new blocks more strategically and simplifies weight calculation
src/coreclr/jit/lsra.cpp	Updates comment to reflect simplified edge splitting behavior

src/coreclr/jit/fgbasic.cpp

amanasifkhalid · 2025-08-05T21:15:11Z

I decided to take another look at this, now that our block layout story is mature. The diffs show large size decreases for both optimized and unoptimized code, though the bulk of these savings are from instrumented tiers. Here's the breakdown:

LSRA calls fgSplitEdge to resolve critical edges. When optimizing, the placement of the new block doesn't matter since block layout will fix it, unless the block is cold, in which case layout ignores it entirely. Thus, placing the new block strategically can yield more compact cold code.
On x64 in Tier1-Instrumented, there are several instances where we now call CORINFO_HELP_COUNTPROFILE32 with a 32-bit argument instead of a 64-bit one, and vice versa. These diffs look spurious, since they have more to do with the profile data's location in memory than the code layout.
fgSplitEdge is also called when inserting instrumentation probes. In Tier0-Instrumented, since blocks won't be reordered, any fallthrough created by fgSplitEdge will be reflected by the final layout.

@AndyAyersMS the churn might be too large to take at this point, but I think this is worth taking at some point, considering the potential size savings.

Maintain fallthrough in fgSplitEdge

c65a99b

ghost added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Sep 5, 2024

dotnet-policy-service bot assigned amanasifkhalid Sep 5, 2024

Typo

5a9cf14

amanasifkhalid mentioned this pull request Sep 6, 2024

JIT: Add simple late layout pass #107483

Merged

Merge branch 'main' into fgSplitEdge

ce7cf33

build-analysis bot mentioned this pull request Sep 11, 2024

restarted. Azure DevOps can't recover from restarts. dotnet/dnceng#3879

Closed

3 tasks

This was referenced Sep 12, 2024

JIT: Flowgraph Modernization and Improved Block Layout in .NET 10 #107749

Closed

JIT: Always use edge weights to assign block weight in Compiler::fgSplitEdge #107941

Merged

amanasifkhalid added 3 commits August 5, 2025 15:24

Revert "Maintain fallthrough in fgSplitEdge"

3747bb1

This reverts commit c65a99b.

Merge branch 'main' into fgSplitEdge

548c020

Maintain fallthrough

8b46f3d

Copilot AI review requested due to automatic review settings August 5, 2025 19:26

Copilot AI reviewed Aug 5, 2025

View reviewed changes

src/coreclr/jit/fgbasic.cpp Show resolved Hide resolved

src/coreclr/jit/fgbasic.cpp Show resolved Hide resolved

Merge branch 'main' into fgSplitEdge

38191d1

build-analysis bot mentioned this pull request Aug 18, 2025

System.Data.OleDb.Tests timeout in net48 x86 Release leg #87783

Closed

JulieLeeMSFT unassigned amanasifkhalid Oct 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

JIT: Try to maintain fallthrough in `Compiler::fgSplitEdge` #107419

JIT: Try to maintain fallthrough in `Compiler::fgSplitEdge` #107419

Uh oh!

amanasifkhalid commented Sep 5, 2024

Uh oh!

dotnet-policy-service bot commented Sep 5, 2024

Uh oh!

amanasifkhalid commented Sep 6, 2024

Uh oh!

AndyAyersMS commented Sep 6, 2024

Uh oh!

amanasifkhalid commented Sep 6, 2024

Uh oh!

AndyAyersMS commented Sep 6, 2024

Uh oh!

amanasifkhalid commented Sep 10, 2024

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

amanasifkhalid commented Aug 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

JIT: Try to maintain fallthrough in Compiler::fgSplitEdge #107419

Are you sure you want to change the base?

JIT: Try to maintain fallthrough in Compiler::fgSplitEdge #107419

Uh oh!

Conversation

amanasifkhalid commented Sep 5, 2024

Uh oh!

dotnet-policy-service bot commented Sep 5, 2024

Uh oh!

amanasifkhalid commented Sep 6, 2024

Uh oh!

AndyAyersMS commented Sep 6, 2024

Uh oh!

amanasifkhalid commented Sep 6, 2024

Uh oh!

AndyAyersMS commented Sep 6, 2024

Uh oh!

amanasifkhalid commented Sep 10, 2024

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

amanasifkhalid commented Aug 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

JIT: Try to maintain fallthrough in `Compiler::fgSplitEdge` #107419

JIT: Try to maintain fallthrough in `Compiler::fgSplitEdge` #107419