fix(wit-bindgen-wrpc): clear out variant decoder internal state after use #257

vados-cosmonic · 2024-08-21T12:26:36Z

This commit fixes a bug which manifests itself when decoding a list of enums that may have separate internal decoder objects to use.

The issue is that storing the internal state inside the bindgen'd Decoder becomes a problem on the second (or subsequent) elements of a list of the given enum.

Given a vec![SomeEnum::X, SomeEnum::Y] the saved "state" for the first becomes the first decoder (i.e. the decoder for SomeEnum::X). When it comes time to parse the discriminant and bytes for Y the existence of the state causes a short circuit.

As the (wrong) decoder will generally try to process some bytes, you lose bytes from the input and then bytes are left over at the next attempt to decode (and then an error occurs). How this manifests is obviously specific to the enum, (wrong) decoder, and involved variants, how they are decoded -- i.e. where the decoding will fail.

This only happens for generating bindings for variant types (i.e. Rust enums), which use the PayloadDecoder machinery to dynamically pick which payload decoder to use, depending on the discriminant read.

Signed-off-by: Roman Volosatovs <[email protected]>

This commit fixes a bug which manifests itself when decoding a list of enums that may have separate internal decoder objects to use. The issue is that storing the internal state inside the bindgen'd `Decoder` becomes a problem on the *second* (or subsequent) elements of a list of the given enum. Given a `vec![SomeEnum::X, SomeEnum::Y]` the saved "state" for the first becomes the first decoder (i.e. the decoder for `SomeEnum::X`). When it comes time to parse the discriminant and bytes for `Y` the existence of the `state` causes a short circuit. As the (wrong) decoder will generally try to process some bytes, you lose bytes from the input and then bytes are left over at the next attempt to decode (and then an error occurs). How this manifests is obviously specific to the enum, (wrong) decoder, and involved variants, how they are decoded -- i.e. where the decoding will fail. This only happens for generating bindings for variant types (i.e. Rust enums), which use the `PayloadDecoder` machinery to dynamically pick *which* payload decoder to use, depending on the discriminant read. Signed-off-by: Victor Adossi <[email protected]>

rvolosatovs

Thanks!
I can confirm that this fixes the sync case, however this would break the async functionality (i.e. nested future or stream values in variants). I'll push a test case and a fix for the latter to this branch

vados-cosmonic · 2024-08-21T13:26:46Z

Ah, so that's what the use case there was for -- to try to reuse an existing async decoder? Thanks for taking a look!

Signed-off-by: Roman Volosatovs <[email protected]>

rvolosatovs · 2024-08-21T13:47:11Z

Ah, so that's what the use case there was for -- to try to reuse an existing async decoder? Thanks for taking a look!

the decoders are generally meant to be reused indeed, which is why resetting their state is important once decoding succeeds (as was fixed in your commit)

Tokio interface is not sufficient for our use case, however, since we may also require deferred decoding of an async value after the synchronous part is done, which is what the Deferred trait handles. The deferred function (if any) must be preserved from the decoder state, which is what I've addressed in 4314bb2

vados-cosmonic requested a review from rvolosatovs as a code owner August 21, 2024 12:26

vados-cosmonic changed the title ~~fix(wit-bindgen-wrpc): clear out internal state after use~~ fix(wit-bindgen-wrpc): clear out variant decoder internal state after use Aug 21, 2024

rvolosatovs and others added 2 commits August 21, 2024 14:38

test: add variant decoding reproducer

4e19cee

Signed-off-by: Roman Volosatovs <[email protected]>

rvolosatovs reviewed Aug 21, 2024

View reviewed changes

feat(rs-bindgen): reset variant payload decoder state

4314bb2

Signed-off-by: Roman Volosatovs <[email protected]>

rvolosatovs force-pushed the fix(wit-bindgen-wrpc)=reuse-of-state-broken-list-of-enums branch from 8a433e3 to 4314bb2 Compare August 21, 2024 13:43

rvolosatovs approved these changes Aug 21, 2024

View reviewed changes

rvolosatovs enabled auto-merge August 21, 2024 13:56

rvolosatovs added this pull request to the merge queue Aug 21, 2024

Merged via the queue into bytecodealliance:main with commit 1a19079 Aug 21, 2024
25 checks passed

vados-cosmonic deleted the fix(wit-bindgen-wrpc)=reuse-of-state-broken-list-of-enums branch August 21, 2024 15:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(wit-bindgen-wrpc): clear out variant decoder internal state after use #257

fix(wit-bindgen-wrpc): clear out variant decoder internal state after use #257

Uh oh!

vados-cosmonic commented Aug 21, 2024

Uh oh!

rvolosatovs left a comment

Uh oh!

vados-cosmonic commented Aug 21, 2024

Uh oh!

rvolosatovs commented Aug 21, 2024

Uh oh!

Uh oh!

Uh oh!

fix(wit-bindgen-wrpc): clear out variant decoder internal state after use #257

fix(wit-bindgen-wrpc): clear out variant decoder internal state after use #257

Uh oh!

Conversation

vados-cosmonic commented Aug 21, 2024

Uh oh!

rvolosatovs left a comment

Choose a reason for hiding this comment

Uh oh!

vados-cosmonic commented Aug 21, 2024

Uh oh!

rvolosatovs commented Aug 21, 2024

Uh oh!

Uh oh!

Uh oh!