core: implement in-block prefetcher #31557

rjl493456442 · 2025-04-04T05:21:29Z

This pull request enhances the block prefetcher by executing transactions in parallel
to warm the cache alongside the main block processor.

Unlike the original prefetcher, which only executes the next block and is limited to chain
syncing, the new implementation can be applied to any block. This makes it useful not
only during chain sync but also for regular block insertion after the initial sync.

TODO

experiment whether the state hashing is necessary in block prefetcher (duplicate with trie prefetcher)

rjl493456442 · 2025-04-04T05:34:40Z

PR: bench05
Master: bench06

PR is about 10% faster than Master
The speedup comes from the faster account/storage read
The memory allocation and CPU usage is about 2x than master

File: geth
Type: inuse_space
Time: 2025-03-31 19:57:06 CST
Entering interactive mode (type "help" for commands, "o" for options)
(pprof) alloc_space
(pprof) top
Showing nodes accounting for 3139.56GB, 49.79% of 6305.50GB total
Dropped 2237 nodes (cum <= 31.53GB)
Showing top 10 nodes out of 245
      flat  flat%   sum%        cum   cum%
  886.65GB 14.06% 14.06%  1019.34GB 16.17%  github.com/ethereum/go-ethereum/trie.(*hasher).hashFullNodeChildren
  472.06GB  7.49% 21.55%   935.83GB 14.84%  github.com/ethereum/go-ethereum/trie.decodeFull
  463.89GB  7.36% 28.90%   463.89GB  7.36%  github.com/ethereum/go-ethereum/trie.decodeRef
     320GB  5.07% 33.98%      320GB  5.07%  github.com/ethereum/go-ethereum/core/vm.(*Memory).Resize
  298.84GB  4.74% 38.72%   298.84GB  4.74%  github.com/ethereum/go-ethereum/rlp.(*encBuffer).makeBytes
  227.90GB  3.61% 42.33%   227.90GB  3.61%  github.com/ethereum/go-ethereum/trie.(*tracer).onRead
  147.57GB  2.34% 44.67%   147.57GB  2.34%  github.com/ethereum/go-ethereum/common.RightPadBytes
  109.50GB  1.74% 46.41%   110.78GB  1.76%  github.com/ethereum/go-ethereum/rlp.(*encBuffer).writeBytes
  106.61GB  1.69% 48.10%   106.61GB  1.69%  golang.org/x/crypto/sha3.NewLegacyKeccak256
  106.55GB  1.69% 49.79%   106.55GB  1.69%  github.com/ethereum/go-ethereum/core/vm.codeBitmap

The main memory allocator is trie loader and trie hasher

core/state_prefetcher.go

rjl493456442 · 2025-05-05T02:52:43Z

@MariusVanDerWijden @fjl Please take a look. This PR is ready for reviewing.

core/state/database.go

MariusVanDerWijden · 2025-05-05T07:34:21Z

core/state_prefetcher.go

+				return nil
+			}
+			// Preload the touched accounts and storage slots in advance
+			sender, err := types.Sender(signer, tx)


Can this realistically fail? Only if either the block as at a fork boundary and the signer changes or if the signature was invalid, right? Shouldn't we just exit here? and else always warm the sender

MariusVanDerWijden · 2025-05-05T07:35:55Z

core/state_prefetcher.go

-		statedb.IntermediateRoot(true)
+				// Preload the contract code if the destination has non-empty code
+				if account != nil && !bytes.Equal(account.CodeHash, types.EmptyCodeHash.Bytes()) {
+					reader.Code(*tx.To(), common.BytesToHash(account.CodeHash))


Is this faster than blindly loading the code?

Should we also follow 7702 delegations here already?

Is this faster than blindly loading the code?

not sure, but it's cheap anyway?

MariusVanDerWijden · 2025-05-08T10:28:47Z

core/state_prefetcher.go

+			// This operation incurs significant memory allocations due to
+			// trie hashing and node decoding. TODO(rjl493456442): investigate
+			// ways to mitigate this overhead.
+			stateCpy.IntermediateRoot(true)


We're only checking the interrupt at the beginning of the call, which was fine previously where we linearly executed the transactions, but now the interrupt will most likely not stop any work from being done, since all go routines are likely to be past the entry point. I'm wondering whether it would make sense to start a second go routine that does something like this:

go func (evm *EVM, interrupt *atomic.Bool) { for { time.Sleep(time.Millisecond) if interrupt != nil && interrupt.Load() { evm.Cancel() } }

(or something similar, you get the gist)

Not really. We limit the parallelism of workers to runtime.NumCPU() / 2. If the available CPU cores is 16, then only 8 routines will be created and transactions are assigned to these workers linearly.

If the prefetching is terminated, we still have very high chance to stop/prevent the following tx executions.

Ah yeah, I missed that. Makes sense

MariusVanDerWijden · 2025-05-08T10:31:10Z

Allocations are really a bit crazy :D Going up to 500MB/s. Just added two nitpicks, otherwise this looks good to me.
As discussed on stabby, we should merge and fix up the allocation bit later

Co-authored-by: Marius van der Wijden <[email protected]>

MariusVanDerWijden

LGTM

This pull request enhances the block prefetcher by executing transactions in parallel to warm the cache alongside the main block processor. Unlike the original prefetcher, which only executes the next block and is limited to chain syncing, the new implementation can be applied to any block. This makes it useful not only during chain sync but also for regular block insertion after the initial sync. --------- Co-authored-by: Marius van der Wijden <[email protected]>

rjl493456442 requested a review from holiman as a code owner April 4, 2025 05:21

rjl493456442 force-pushed the in-block-cachewarmmer branch 2 times, most recently from f4f1f5a to ce318a3 Compare April 6, 2025 11:55

rjl493456442 force-pushed the in-block-cachewarmmer branch 2 times, most recently from ebec558 to 5abc763 Compare April 28, 2025 06:57

MariusVanDerWijden reviewed Apr 28, 2025

View reviewed changes

core/state_prefetcher.go Outdated Show resolved Hide resolved

rjl493456442 force-pushed the in-block-cachewarmmer branch 2 times, most recently from 8ef7604 to 271503f Compare May 5, 2025 02:41

rjl493456442 added this to the 1.15.12 milestone May 5, 2025

MariusVanDerWijden reviewed May 5, 2025

View reviewed changes

core/state/database.go Outdated Show resolved Hide resolved

MariusVanDerWijden reviewed May 5, 2025

View reviewed changes

rjl493456442 added 2 commits May 8, 2025 10:15

core: implement in-block prefetcher

21be8b4

core: address comments from marius

8be2f84

rjl493456442 force-pushed the in-block-cachewarmmer branch from 9260d8e to 8be2f84 Compare May 8, 2025 02:15

MariusVanDerWijden reviewed May 8, 2025

View reviewed changes

Update core/state/database.go

b5d92a3

Co-authored-by: Marius van der Wijden <[email protected]>

MariusVanDerWijden approved these changes May 8, 2025

View reviewed changes

rjl493456442 merged commit 485ff4b into ethereum:master May 8, 2025
3 of 4 checks passed

dajuguan mentioned this pull request May 13, 2025

EthStorage Devs Meeting #127 Agenda ethstorage/pm#196

Closed

BrewTestBot mentioned this pull request Jun 26, 2025

ethereum 1.16.0 Homebrew/homebrew-core#228279

Closed

allformless mentioned this pull request Aug 6, 2025

upstream: merge geth-v1.16.1 bnb-chain/bsc#3261

Merged

sebastianst mentioned this pull request Aug 15, 2025

all: Update op-geth dependency, based on geth v1.16.1 ethereum-optimism/optimism#16785

Merged

core: implement in-block prefetcher #31557

core: implement in-block prefetcher #31557

Uh oh!

Conversation

rjl493456442 commented Apr 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rjl493456442 commented Apr 4, 2025

Uh oh!

Uh oh!

rjl493456442 commented May 5, 2025

Uh oh!

Uh oh!

MariusVanDerWijden May 5, 2025

Choose a reason for hiding this comment

Uh oh!

MariusVanDerWijden May 5, 2025

Choose a reason for hiding this comment

Uh oh!

MariusVanDerWijden May 5, 2025

Choose a reason for hiding this comment

Uh oh!

rjl493456442 May 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MariusVanDerWijden May 8, 2025

Choose a reason for hiding this comment

Uh oh!

rjl493456442 May 8, 2025

Choose a reason for hiding this comment

Uh oh!

MariusVanDerWijden May 8, 2025

Choose a reason for hiding this comment

Uh oh!

MariusVanDerWijden commented May 8, 2025

Uh oh!

MariusVanDerWijden left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rjl493456442 commented Apr 4, 2025 •

edited

Loading

rjl493456442 May 8, 2025 •

edited

Loading