SafeHeap: Handle overflows when adding the pointer and the size #6409

kripken · 2024-03-18T23:37:15Z

E.g. loading 4 bytes from 2^32 - 2 should error: 2 bytes are past the maximum
address. Before this PR we added 2^32 - 2 + 4 and overflowed to 2, which we
saw as a low and safe address. This PR adds an extra check for an overflow in
that add.

Fixes emscripten-core/emscripten#21557

kripken · 2024-03-18T23:38:00Z

(Testing this here is not easy, as we don't have a full emscripten output with sbrk_ptr support etc. But we can add an Emscripten test.)

sbc100

Looks like a lot of extra generated code. Do you think its worth it?

Also, it not obvious to me how those test got updated. Presumably you didn't update them by hand. Can you mention the command you used to update them?

kripken · 2024-03-18T23:57:53Z

I think this is worth fixing. Safe-heap builds can afford to be a little larger and slower, even if this particular corner case is very rare.

The test update here is 100% automatic. See the last section in the readme testing notes, specifically I ran ./scripts/auto_update_tests.py wasm-opt.

sbc100 · 2024-03-19T00:00:48Z

I think this is worth fixing. Safe-heap builds can afford to be a little larger and slower, even if this particular corner case is very rare.

Sure but isn't this change almost doubling the overhead, for the sake of a fairly rare case? Maybe its not rare? Or maybe i'm not understanding the proportional impact of this chage?

kripken · 2024-03-19T00:28:35Z

The overhead is significant I'm guessing, but not double. A typical safe-heap method here is 18 instructions, and this adds 5. But much of the overhead is likely in the call to the safe-heap method anyhow. Still, maybe it's worth measuring this, as e.g. if this prevents inlining of these methods (assuming they were inlined before) then maybe it is significant actually. I can do some measurements.

sbc100 · 2024-03-19T00:56:26Z

The overhead is significant I'm guessing, but not double. A typical safe-heap method here is 18 instructions, and this adds 5. But much of the overhead is likely in the call to the safe-heap method anyhow. Still, maybe it's worth measuring this, as e.g. if this prevents inlining of these methods (assuming they were inlined before) then maybe it is significant actually. I can do some measurements.

No worries, that sounds reasonable to me. If folks complain that its prohibitive we can always reasses.

kripken · 2024-03-19T20:33:37Z

Measuring this, these functions were already too big to get inlined, so that didn't change. On fannkuch I see a negligible code size change (much less than 1%), though speed does decrease by 5%. So this is a regression, but in this type of build I think it's ok when it improves the correctness of what we are checking, just like more checks make UBSan builds slower.

If that 5% mattered then we could optimize this actually. Atm we emit

if (overflow) segfault();
if (out of bounds) segfault();

And we really just need one if here. Fixing that would require some refactoring here.

I did see that changing to

if (overflow) { segfault(); unreachable } // add an unreachable here
if (out of bounds) segfault();

removes the regression here on speed. Adding that unreachable byte seems to help the VM, perhaps by telling it no control flow merges happen later. Anyhow, it seems we could easily speed this up with some work, but not sure if it's worthwhile atm.

sbc100

lgtm either way

kripken · 2024-03-19T21:28:51Z

This needed some refactoring anyhow - I only put this in the loads, but the stores need it too. In the stores this added a bit more overhead to 6.5%.

I added those unreachables and the overhead is down to 1.5%, which is likely not noticeable, and seems useful anyhow. These are basically noreturn calls, which clang would append an unreachable to as well.

sbc100 · 2024-07-12T00:11:31Z

Should we land this?

kripken · 2024-07-12T20:07:52Z

I'm not sure. 6.5% overhead is maybe not a big deal in a slow build anyhow, but it is for quite small gain. What do you think?

sbc100 · 2024-07-12T20:22:08Z

I don't have a strong opinion.. just trying to close some issues :)

kripken · 2024-07-12T20:25:47Z

Ok, let's land it. The point of SAFE_HEAP is to catch memory bugs, and this situation is a memory bug.

kripken added 2 commits March 18, 2024 16:34

work

f4e6bfd

format

383cfd3

kripken requested a review from sbc100 March 18, 2024 23:37

sbc100 reviewed Mar 18, 2024

View reviewed changes

sbc100 approved these changes Mar 19, 2024

View reviewed changes

sbc100 reviewed Mar 19, 2024

View reviewed changes

kripken added 4 commits March 19, 2024 14:16

clean

f31de86

format

5ec9191

optimize

5ca0295

format

b1688d0

kripken merged commit 0e0e08d into WebAssembly:main Jul 12, 2024

kripken deleted the safe-heap.overflow branch July 12, 2024 20:37

gkdn mentioned this pull request Aug 31, 2024

stringconsts gkdn/binaryen#1

Closed

SafeHeap: Handle overflows when adding the pointer and the size #6409

SafeHeap: Handle overflows when adding the pointer and the size #6409

Uh oh!

Conversation

kripken commented Mar 18, 2024

Uh oh!

kripken commented Mar 18, 2024

Uh oh!

sbc100 left a comment

Choose a reason for hiding this comment

Uh oh!

kripken commented Mar 18, 2024

Uh oh!

sbc100 commented Mar 19, 2024

Uh oh!

kripken commented Mar 19, 2024

Uh oh!

sbc100 commented Mar 19, 2024

Uh oh!

kripken commented Mar 19, 2024

Uh oh!

sbc100 left a comment

Choose a reason for hiding this comment

Uh oh!

kripken commented Mar 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sbc100 commented Jul 12, 2024

Uh oh!

kripken commented Jul 12, 2024

Uh oh!

sbc100 commented Jul 12, 2024

Uh oh!

kripken commented Jul 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kripken commented Mar 19, 2024 •

edited

Loading