Fix LoaderHeap's free list growing more than expected by eduardo-vp · Pull Request #129203 · dotnet/runtime

eduardo-vp · 2026-06-10T01:22:46Z

I was working with the test in https://github.com/korchak-aleksandr/net10-regression-repro and found out that the free list in UnlockedLoaderHeap grows to thousands of elements, which makes allocations very slow since we do a linear scan of this free list for each one of them.

In these scenarios multiple threads might need the same generic instantiation simultaneously and they all race to create/publish it. Multiple threads can lose the race and quickly add blocks to the free list since they don't need that memory. Subsequent calls that need generic instantiations do a linear scan of the free list to find a memory block to reuse. This ends up taking a lot of time due to its size (can be up to ten of thousands).

This PR stops making thread race to create/publish such that we don't insert several blocks in the free list.

	.NET 10	.NET 10 + this PR
Time	91.2 s	3.9 s
Max RSS	114.1 MB	117.5 MB

Copilot

Pull request overview

This PR restructures UnlockedLoaderHeap’s free list to reduce allocation-time overhead in backout-heavy scenarios by replacing a single linear-scanned free list with size-segregated buckets for common small block sizes plus an overflow list for larger blocks.

Changes:

Replaces m_pFirstFreeBlock with 32 size buckets (pointer-size increments) and a separate “large/overflow” free list.
Updates free-block insertion/allocation logic to use bucketed O(1) reuse for small sizes, and retains linear scanning only for the overflow list (including a stress-log warning on long scans).
Adjusts debug-only free-list dumping and validation to iterate across all buckets and the overflow list.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
src/coreclr/utilcode/loaderheap.cpp	Implements bucket initialization, bucket-aware allocation/insertion, overflow scan warning, and updates debug dump/validation to traverse buckets.
src/coreclr/utilcode/loaderheap_shared.h	Updates `LoaderHeapFreeBlock` API to no longer take an explicit head pointer (heap chooses bucket internally).
src/coreclr/inc/loaderheap.h	Adds bucket/overflow free list fields and related constants to `UnlockedLoaderHeap`.

jkotas · 2026-06-10T01:33:39Z

In these scenarios multiple threads might need the same generic instantiation simultaneously and they all race to create/publish it

Where is this race condition exactly? Can we add a lock there instead?

The freelist in LoaderHeap is meant to be only used in error conditions to backout types that failed to load, or to deal with rare race condition. If you see the freelist growing this much, it means that the loader heap is not used correctly. We should fix that instead.

eduardo-vp · 2026-06-10T01:53:05Z

I'll take a look, this the part where many threads lose

runtime/src/coreclr/vm/genmeth.cpp

Lines 493 to 542 in 24547a7

    
                       InstantiatedMethodDesc *pOldMD = FindLoadedInstantiatedMethodDesc(pExactMT, 
        
                                                                 pGenericMDescInRepMT->GetMemberDef(), 
        
                                                                 methodInst, 
        
                                                                 getWrappedCode, 
        
                                                                 pGenericMDescInRepMT->IsAsyncVariantMethod()); 
        
                       if (pOldMD == NULL) 
        
                       { 
        
                           // No one else got there first, our MethodDesc wins. 
        
                           amt.SuppressRelease(); 
        
           #ifdef _DEBUG 
        
                           SString name; 
        
                           TypeString::AppendMethodDebug(name, pNewMD); 
        
                           const char* pDebugNameUTF8 = name.GetUTF8(); 
        
                           const char* verb = "Created"; 
        
                           if (pWrappedMD) 
        
                               LOG((LF_CLASSLOADER, LL_INFO1000, 
        
                                   "GENERICS: %s instantiating-stub method desc %s with dictionary size %d\n", 
        
                                   verb, pDebugNameUTF8, infoSize)); 
        
                           else 
        
                               LOG((LF_CLASSLOADER, LL_INFO1000, 
        
                                    "GENERICS: %s instantiated method desc %s\n", 
        
                                    verb, pDebugNameUTF8)); 
        
                           S_SIZE_T safeLen = S_SIZE_T(strlen(pDebugNameUTF8))+S_SIZE_T(1); 
        
                           if(safeLen.IsOverflow()) COMPlusThrowHR(COR_E_OVERFLOW); 
        
                           size_t len = safeLen.Value(); 
        
                           pNewMD->m_pszDebugMethodName = (char*) (void*)pAllocator->GetLowFrequencyHeap()->AllocMem(safeLen); 
        
                           _ASSERTE(pNewMD->m_pszDebugMethodName); 
        
                           strcpy_s((char *) pNewMD->m_pszDebugMethodName, len, pDebugNameUTF8); 
        
                           pNewMD->m_pszDebugClassName = pExactMT->GetDebugClassName(); 
        
                           pNewMD->m_pszDebugMethodSignature = (LPUTF8)pNewMD->m_pszDebugMethodName; 
        
           #endif // _DEBUG 
        
                           // Generic methods can't be varargs. code:MethodTableBuilder::ValidateMethods should have checked it. 
        
                           _ASSERTE(!pNewMD->IsVarArg()); 
        
                           // Verify that we are not creating redundant MethodDescs 
        
                           _ASSERTE(!pNewMD->IsTightlyBoundToMethodTable()); 
        
                           // The method desc is fully set up; now add to the table 
        
                           InstMethodHashTable* pTable = pExactMDLoaderModule->GetInstMethodHashTable(); 
        
                           pTable->InsertMethodDesc(pNewMD); 
        
                       } 
        
                       else 
        
                           pNewMD = pOldMD; 
        
                       // CrstHolder goes out of scope here 
        
                   }

This reverts commit 96f0039.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 2 comments.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 2 comments.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 2 comments.

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 3 comments.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated no new comments.

jkotas · 2026-06-15T22:14:55Z

Could you please run a perf test that tries to create many different method instantiations on multiple threads, and see whether taking a lock around MethodDesc creation makes it measurably slower?

If we find that it is getting measurably slower, we may want to do something else about it - e.g. move more of the work outside the lock.

eduardo-vp · 2026-06-16T03:14:15Z

I used a script to create a Gen.cs file and test the creation of 150k different method instantiations.

public static class W
{
    [System.Runtime.CompilerServices.MethodImpl(System.Runtime.CompilerServices.MethodImplOptions.NoInlining)]
    public static long M1<T>() => System.Runtime.CompilerServices.Unsafe.SizeOf<T>();
    [System.Runtime.CompilerServices.MethodImpl(System.Runtime.CompilerServices.MethodImplOptions.NoInlining)]
    public static long M2<T>() => System.Runtime.CompilerServices.Unsafe.SizeOf<T>();
    [System.Runtime.CompilerServices.MethodImpl(System.Runtime.CompilerServices.MethodImplOptions.NoInlining)]
    public static long M3<T>() => System.Runtime.CompilerServices.Unsafe.SizeOf<T>();
    [System.Runtime.CompilerServices.MethodImpl(System.Runtime.CompilerServices.MethodImplOptions.NoInlining)]
    public static long M4<T>() => System.Runtime.CompilerServices.Unsafe.SizeOf<T>();
    [System.Runtime.CompilerServices.MethodImpl(System.Runtime.CompilerServices.MethodImplOptions.NoInlining)]
    public static long M5<T>() => System.Runtime.CompilerServices.Unsafe.SizeOf<T>();
}

public struct S0 { }
public struct S1 { }
public struct S2 { }
public struct S3 { }
public struct S4 { }
// ...
public struct S29998 { }
public struct S29999 { }

public static class Gen
{
    public static readonly System.Action[] Actions = new System.Action[]
    {
        static () => { W.M1<S0>(); W.M2<S0>(); W.M3<S0>(); W.M4<S0>(); W.M5<S0>(); },
        static () => { W.M1<S1>(); W.M2<S1>(); W.M3<S1>(); W.M4<S1>(); W.M5<S1>(); },
        static () => { W.M1<S2>(); W.M2<S2>(); W.M3<S2>(); W.M4<S2>(); W.M5<S2>(); },
        static () => { W.M1<S3>(); W.M2<S3>(); W.M3<S3>(); W.M4<S3>(); W.M5<S3>(); },
        static () => { W.M1<S4>(); W.M2<S4>(); W.M3<S4>(); W.M4<S4>(); W.M5<S4>(); },
        // ...
        static () => { W.M1<S29998>(); W.M2<S29998>(); W.M3<S29998>(); W.M4<S29998>(); W.M5<S29998>(); },
        static () => { W.M1<S29999>(); W.M2<S29999>(); W.M3<S29999>(); W.M4<S29999>(); W.M5<S29999>(); },
    }
}

The main programs just runs a loop

Parallel.For(0, N, new ParallelOptions { MaxDegreeOfParallelism = Workers }, i => actions[i]());

net10 takes ~690 ms while .net10 + this change takes ~705 ms (around 2% slower in this extreme case). I think this should be fine.

Add size-segregated buckets

96f0039

Copilot AI review requested due to automatic review settings June 10, 2026 01:22

Copilot started reviewing on behalf of eduardo-vp June 10, 2026 01:22 View session

github-actions Bot added the area-VM-coreclr label Jun 10, 2026

Copilot AI reviewed Jun 10, 2026

View reviewed changes

Comment thread src/coreclr/utilcode/loaderheap.cpp Outdated

Comment thread src/coreclr/utilcode/loaderheap.cpp Outdated

Comment thread src/coreclr/utilcode/loaderheap_shared.h Outdated

build-analysis Bot mentioned this pull request Jun 10, 2026

Unable to pull image from mcr.microsoft.com #117164

Open

Eduardo Velarde added 2 commits June 9, 2026 23:48

Remove race in NewInstantiatedMethodDesc

f8bc01d

Revert "Add size-segregated buckets"

539beab

This reverts commit 96f0039.

This was referenced Jun 10, 2026

slow macOS - "##[error]The job running on agent Azure Pipelines 9 ran longer than the maximum time of 60 minutes." dotnet/dnceng#1883

Open

The Operation will be canceled. The next steps may not contain expected logs. dotnet/dnceng#3008

Open

eduardo-vp requested a review from Copilot June 10, 2026 21:01

Copilot started reviewing on behalf of eduardo-vp June 10, 2026 21:01 View session

Copilot AI reviewed Jun 10, 2026

View reviewed changes

Comment thread src/coreclr/vm/genmeth.cpp

Comment thread src/coreclr/vm/genmeth.cpp

Place constraints check outside of the lock

c6eebc9

eduardo-vp changed the title ~~Add size-segregated buckets in UnlockedLoaderHeap~~ Fix LoaderHeap's free list growing more than expected Jun 11, 2026

eduardo-vp requested a review from Copilot June 11, 2026 00:17

Copilot started reviewing on behalf of eduardo-vp June 11, 2026 00:18 View session

Copilot AI reviewed Jun 11, 2026

View reviewed changes

Comment thread src/coreclr/vm/genmeth.cpp Outdated

Comment thread src/coreclr/vm/genmeth.cpp Outdated

jkotas reviewed Jun 11, 2026

View reviewed changes

Comment thread src/coreclr/vm/genmeth.cpp Outdated

Do the constraint check upfront

bf0b174

eduardo-vp requested a review from Copilot June 11, 2026 02:45

Copilot started reviewing on behalf of eduardo-vp June 11, 2026 02:45 View session

Copilot AI reviewed Jun 11, 2026

View reviewed changes

Comment thread src/coreclr/vm/genmeth.cpp Outdated

Comment thread src/coreclr/vm/genmeth.cpp

Potential fix for pull request finding

e557c55

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings June 11, 2026 02:54

Copilot started reviewing on behalf of eduardo-vp June 11, 2026 02:54 View session

Copilot AI reviewed Jun 11, 2026

View reviewed changes

Comment thread src/coreclr/vm/genmeth.cpp

Comment thread src/coreclr/vm/genmeth.cpp Outdated

Comment thread src/coreclr/vm/genmeth.cpp

build-analysis Bot mentioned this pull request Jun 11, 2026

NuGet failing with Response status code does not indicate success: 503 (Service Unavailable) dotnet/arcade#11723

Open

5 tasks

build-analysis Bot mentioned this pull request Jun 11, 2026

[browser][coreCLR] Wasm.Console.Node.Sample - undefined symbol: SystemInteropJS_GetManagedStackTrace #129229

Closed

PR feedback

0fdffb6

jkotas reviewed Jun 15, 2026

View reviewed changes

Comment thread src/coreclr/vm/genmeth.cpp

Call SupressRelease after method desc is registered

ba5d141

Copilot AI review requested due to automatic review settings June 15, 2026 21:39

Copilot started reviewing on behalf of eduardo-vp June 15, 2026 21:39 View session

Copilot AI reviewed Jun 15, 2026

View reviewed changes

Merge branch 'main' into xunit-reg-linux

e6d60d0

Conversation

eduardo-vp commented Jun 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jkotas commented Jun 10, 2026

Uh oh!

eduardo-vp commented Jun 10, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

jkotas commented Jun 15, 2026

Uh oh!

eduardo-vp commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

eduardo-vp commented Jun 10, 2026 •

edited

Loading