JIT: enable instrumentation for inlinees #119658

AndyAyersMS · 2025-09-12T18:03:42Z

If we are doing an optimized+instrumented jit pass (like we do for R2R methods) allow the inlinees to be instrumented. This gives us a chance to collect profile data for methods that are always inlined.

All inlinees currently share the same profile data segment with each other and with a root compilation of the method (if any). So this profile is "context-free".

If there is a schema mismatch (say there is a stale pre-existing R2R schema) then we disregard the old schema and use the new one.

This is mostly just removing assertions and bailouts and bad assumptions. However if the inlinee has an instrumentable call in its return expression we need to temporarily insert it as normal IR so it can get instrumented like anything else. We then undo this after instrumentation. Any residual impact from instrumentation will be left either in prior statements or commas inside the return expression.

Closes #44372. Closes #91938.

If we are doing an optimized+instrumented jit pass (like we do for R2R methods) allow the inlinees to be instrumented. This gives us a chance to collect profile data for methods that are always inlined. All inlinees currently share the same profile data segment with each other and with a root compilation of the method (if any). So this profile is "context-free". If there is a schema mismatch (say there is a stale pre-existing R2R schema) then subsequent instrumentation will fail. This is something we need to keep an eye on. Right now we can't distinguish this failure from other kinds of schema allocation failures. Closes dotnet#44372

AndyAyersMS · 2025-09-12T18:27:25Z

This should address the long-standing regression in #91938. Local runs show it possibly helps but doesn't completely fix the regression. Still need to verify it is actually getting instrumentation data for the key inlinees.

Generally speaking, this should help reduce the gap between running with R2R enabled (where we can lose profile data, or optimize based on stale data) and R2R disabled.

AndyAyersMS · 2025-09-13T01:12:54Z

I have a bunch more fixes on top of this, will PR them soon.

One issue I'm seeing is IL mismatches. Some of it I've tracked down to IL differences between the PGO schema embedded into corelib and the current corelib IL; that's somewhat understandable as the PGO data likely goes stale, and should be (generally) a non-product issue.

Would still be good to flag this case and allow the dynamic PGO to take over.

I also suspect inlinee schemas may diverge from root schemas even for the same IL, but need to do more work to track this down.

AndyAyersMS · 2025-09-16T01:22:41Z

Still some failures to sort out. Also my local perf runs aren't as promising as I'd hoped.

AndyAyersMS · 2025-09-18T17:24:58Z

Ok, think I tracked down the last few issues.

AndyAyersMS · 2025-09-18T17:30:33Z

Local run. Regression fixed. TC now matches no-R2R perf here.

Method	Runtime	Options	Mean	Error	StdDev	Median	Min	Max	Ratio	RatioSD	Allocated	Alloc Ratio
LastIndexOf_Word_NotFound	PR	(en-US, OrdinalIgnoreCase, False)	574.6 ns	9.34 ns	8.74 ns	573.9 ns	562.4 ns	588.8 ns	0.89	0.02	-	NA
LastIndexOf_Word_NotFound	.NET 10.0	(en-US, OrdinalIgnoreCase, False)	784.9 ns	13.04 ns	12.19 ns	785.1 ns	770.8 ns	807.0 ns	1.21	0.02	-	NA
LastIndexOf_Word_NotFound	.NET 7.0	(en-US, OrdinalIgnoreCase, False)	647.1 ns	9.97 ns	9.33 ns	645.9 ns	632.7 ns	666.0 ns	1.00	0.02	-	NA

AndyAyersMS · 2025-09-18T17:38:12Z

@EgorBot --intel --arm --filter System.Globalization.Tests.StringSearch.LastIndexOf_Word_NotFound(Options: (en-US, OrdinalIgnoreCase, False))

EgorBo · 2025-09-18T17:39:33Z

@EgorBot --intel --arm --filter System.Globalization.Tests.StringSearch.LastIndexOf_Word_NotFound(Options: (en-US, OrdinalIgnoreCase, False))

AndyAyersMS · 2025-09-18T17:40:17Z

@EgorBot --intel --arm --filter System.Globalization.Tests.StringSearch.LastIndexOf_Word_NotFound

AndyAyersMS · 2025-09-18T18:32:44Z

/azp run runtime-coreclr libraries-pgo, runtime-coreclr pgostress, runtime-coreclr pgo

azure-pipelines · 2025-09-18T18:33:08Z

Azure Pipelines successfully started running 3 pipeline(s).

Copilot

Pull Request Overview

This PR enables instrumentation for inlined methods (inlinees) when performing optimized+instrumented JIT passes, such as R2R methods. This allows collection of profile data for methods that are always inlined, providing "context-free" profile data.

Key changes:

Removes assertions and bailouts that prevented inlinee instrumentation
Handles schema mismatches by discarding stale data and using new schemas
Adds special handling for inlinee return expressions during instrumentation

Reviewed Changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
src/coreclr/vm/pgo.cpp	Updates PGO allocation to handle schema mismatches by removing stale data and relaying schemas
src/coreclr/vm/jitinterface.cpp	Changes PGO allocation to use method handle instead of method being compiled
src/coreclr/jit/jitconfigvalues.h	Adds JitInstrumentIfOptimizing configuration option
src/coreclr/jit/importercalls.cpp	Removes inlining restrictions for instrumentation and tail call handling
src/coreclr/jit/fgprofile.cpp	Removes inlining restrictions and adds wrapper for inlinee return expression handling
src/coreclr/jit/fginline.cpp	Removes clearing of instrumentation flags for inlinees
src/coreclr/jit/compiler.h	Adds fgInstrumentMethodCore method declaration
src/coreclr/jit/compiler.cpp	Removes inlining assertions and adds debug instrumentation forcing
src/coreclr/inc/pgo_formatprocessing.h	Changes schema compatibility check to return match count instead of boolean

src/coreclr/inc/pgo_formatprocessing.h

src/coreclr/jit/fgprofile.cpp

AndyAyersMS · 2025-09-18T19:05:49Z

@dotnet/jit-contrib PTAL

There will be a sizable code size / tp increase in some collections.

AndyAyersMS · 2025-09-18T21:08:47Z

Libraries-pgo failure looks like a GC hole but suspect it is something pre-existing that we're just uncovering here.

Pgo failure is in stackoverflow tester.

Diffs. As promised large size increases, though the bulk of that is from libraries tests and (not surprisingly) mostly in Tier1-instr.

A decent fraction of context misses too, so the accuracy of SPMI is a bit questionable. I am not really sure how to do a fair assessment here as the new data gathered by this instrumentation should have a big effect on Tier1.

EgorBo · 2025-09-18T22:21:57Z

@EgorBot -amd -arm

using System;
using System.Runtime.CompilerServices;
using System.Runtime.InteropServices;
using BenchmarkDotNet.Attributes;
using BenchmarkDotNet.Running;

public class Prog
{
    static void Main(string[] args)
    {
        BenchmarkSwitcher.FromAssembly(typeof(Prog).Assembly).Run(args);
    }

    byte[] Src = new byte[32];
    byte[] Dst = new byte[32];

    [Benchmark]
    public void Copy() => Src.AsSpan().CopyTo(Dst);
}

AndyAyersMS · 2025-09-19T14:21:10Z

@EgorBo PTAL

EgorBo

Nice!

src/coreclr/inc/pgo_formatprocessing.h

github-actions bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Sep 12, 2025

dotnet-policy-service bot assigned AndyAyersMS Sep 12, 2025

build-analysis bot mentioned this pull request Sep 12, 2025

LibraryImportGenerator.Unit.Tests crashing on linux-x64 mono interpreter #100800

Open

AndyAyersMS added 2 commits September 13, 2025 11:52

fixes

4ad2590

fixes

ac9605e

build-analysis bot mentioned this pull request Sep 16, 2025

Sometimes the helix SDK uses GetWorkItemsAsync when workitems aren't done processing. dotnet/dnceng#6011

Open

3 tasks

last? round of fixes

49f11bb

EgorBot mentioned this pull request Sep 18, 2025

Benchmarks for #119658 (AndyAyersMS) EgorBot/runtime-utils#490

Open

EgorBot mentioned this pull request Sep 18, 2025

Benchmarks for #119658 (EgorBo) EgorBot/runtime-utils#491

Open

EgorBot mentioned this pull request Sep 18, 2025

Benchmarks for #119658 (AndyAyersMS) EgorBot/runtime-utils#492

Open

AndyAyersMS marked this pull request as ready for review September 18, 2025 19:01

Copilot AI review requested due to automatic review settings September 18, 2025 19:01

Copilot AI reviewed Sep 18, 2025

View reviewed changes

src/coreclr/inc/pgo_formatprocessing.h Show resolved Hide resolved

src/coreclr/jit/fgprofile.cpp Outdated Show resolved Hide resolved

src/coreclr/jit/fgprofile.cpp Outdated Show resolved Hide resolved

build-analysis bot mentioned this pull request Sep 18, 2025

System.Diagnostics.Tests.PerformanceCounterTests.PerformanceCounter_IncrementBy_IncrementByReadOnly failed with "Attempted to perform an unauthorized operation" #116014

Closed

add config to disable; add stress mode; fix copilot issue

4d71ea3

EgorBot mentioned this pull request Sep 18, 2025

Benchmarks for #119658 (EgorBo) EgorBot/runtime-utils#493

Open

AndyAyersMS requested a review from EgorBo September 19, 2025 14:20

EgorBo approved these changes Sep 21, 2025

View reviewed changes

EgorBo reviewed Sep 21, 2025

View reviewed changes

src/coreclr/inc/pgo_formatprocessing.h Show resolved Hide resolved

AndyAyersMS merged commit ad22abb into dotnet:main Sep 22, 2025
172 checks passed

LoopedBard3 mentioned this pull request Sep 24, 2025

[Perf] Linux/x64: JIT: enable instrumentation for inlinees Regressions #120068

Open

JIT: enable instrumentation for inlinees #119658

JIT: enable instrumentation for inlinees #119658

Uh oh!

Conversation

AndyAyersMS commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AndyAyersMS commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AndyAyersMS commented Sep 13, 2025

Uh oh!

AndyAyersMS commented Sep 16, 2025

Uh oh!

AndyAyersMS commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AndyAyersMS commented Sep 18, 2025

Uh oh!

AndyAyersMS commented Sep 18, 2025

Uh oh!

EgorBo commented Sep 18, 2025

Uh oh!

AndyAyersMS commented Sep 18, 2025

Uh oh!

AndyAyersMS commented Sep 18, 2025

Uh oh!

azure-pipelines bot commented Sep 18, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AndyAyersMS commented Sep 18, 2025

Uh oh!

AndyAyersMS commented Sep 18, 2025

Uh oh!

EgorBo commented Sep 18, 2025

Uh oh!

AndyAyersMS commented Sep 19, 2025

Uh oh!

EgorBo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AndyAyersMS commented Sep 12, 2025 •

edited

Loading

AndyAyersMS commented Sep 12, 2025 •

edited

Loading

AndyAyersMS commented Sep 18, 2025 •

edited

Loading