Generialise FixedBufRegistry and FixedBufPool #213

ollie-etl · 2023-01-19T16:15:34Z

Extends both registries to accept any Iter of IoBufMut, instead of Vec.

This is motivated by my usecase, which cannot safely provide its memory as a Vec. Whilst I can unsafely create one,
it would be invalid to call drop on the created Vec

This implementation, which explicitly keeps the backing buffers present, rather then mem::forget and recreate means the Drop handling is preserved.

As a trivial example, this would now admit Box<[u8]> as a buffer.

I realize I could provide my own implementation of the pools, but it seem unnecessary - the generalized version serves both purposes. The overhead is an additional allocation on pool creation, although this will be optimized away by in place specialisation, for the common use case.

Extends both registries to accept any `Iter` of `IoBufMut`, instead of Vec<u8>. This is motivated by my usecase, which cannot safely provide its memory as a Vec, and needs custom cleanup. This implementation, which explcitly keeps the backing buffers present, rather then mem::forget and recreate means the Drop handling is preserved. As a trivial example, this would now admit Box<[u8]> as a buffer.

ollie-etl · 2023-01-19T16:16:07Z

@mzabaluev and @FrankReh I'd appreciate your feedback.

FrankReh · 2023-01-19T18:02:58Z

This looks good to me. The way drops are handled now, would it let me carve out the buffers from a single mmap'ed block if I wrote the iterator to unsafely step through the allocated memory? That's been my last sticking point as I think some kernel interfaces are more efficient with page aligned buffers or in some cases, even require page aligned buffers.

mzabaluev · 2023-01-19T20:55:28Z

The way drops are handled now, would it let me carve out the buffers from a single mmap'ed block

I think that's what the system allocator does in some cases, or maybe that's only if the allocation sizes are sufficiently big.

some kernel interfaces are more efficient with page aligned buffers or in some cases, even require page aligned buffers.

I believe this does not apply to io-uring: I/O operations and IORING_REGISTER_BUFFERS do not require page alignment. The latter, in fact, locates and maps the pages that contain the buffers, precisely to make operations with fixed buffers maximally efficient.

src/buf/fixed/registry.rs

src/buf/fixed/pool.rs

FrankReh · 2023-01-19T21:16:19Z

I believe this does not apply to io-uring: I/O operations and IORING_REGISTER_BUFFERS do not require page alignment

I beg to differ. (But I'm often wrong.) The tests that come with the liburing C library itself use O_DIRECT in many places, as well as mmap for that very reason.

When I have an app that will use the same buffers countless times for I/O, I will certainly want them page aligned. For small use cases, I won't even go to the trouble of registering the buffers first.

ollie-etl · 2023-01-20T08:55:05Z

This looks good to me. The way drops are handled now, would it let me carve out the buffers from a single mmap'ed block if I wrote the iterator to unsafely step through the allocated memory? That's been my last sticking point as I think some kernel interfaces are more efficient with page aligned buffers or in some cases, even require page aligned buffers.

@FrankReh Yes: thats what I do with it.

For the common case of cassing in a Vec<T>, this should collect in place.

FrankReh

I like it.

This and the related PR about supporting a next Future are making this feel like a very tight aspect of the crate.

@mzabaluev Would you be willing to give this a thumbs up?

src/buf/fixed/registry.rs

src/buf/fixed/pool.rs

src/buf/fixed/registry.rs

ollie-etl · 2023-01-20T19:40:10Z

@FrankReh are all of your concerns addressed?

ollie-etl · 2023-01-20T19:40:40Z

@mzabaluev, is there anything you would like addressing?

ollie-etl · 2023-01-20T19:58:44Z

@FrankReh I wanted to reply to this.

This looks good to me. The way drops are handled now, would it let me carve out the buffers from a single mmap'ed block if I wrote the iterator to unsafely step through the allocated memory? That's been my last sticking point as I think some kernel interfaces are more efficient with page aligned buffers or in some cases, even require page aligned buffers.

You can do this in a unsafe way if that is your use case. Your iterator will produce into something like Segment, and @mzabaluev is correct that normally you'd pay the (small) Rc penalty. They'd be something like

struct Segment {
   backing_buffer: Rc<MmapRaw>,
   ptr: *mut u8,
   init: usize,
   len: usize
}

However, because you know they that if the pool is being dropped, none are in use, and that all remain alive until the pool is dropped, you can replace backing_buffer with Option<MmapRaw, with all but a single Segment being None, removing the Rc overhead. Yuk, I know.

FrankReh · 2023-01-20T20:42:25Z

You can do this in a unsafe way if that is your use case. Your iterator will produce into something like Segment, and @mzabaluev is correct that normally you'd pay the (small) Rc penalty.

Thanks. I don't mind tapping into unsafe. I had hoped the iterator could be implemented programmatically, but really just to keep from polluting the heap with unnecessary stuff - not to safe space or startup/tear-down time really. For me, setting up a pool would be once at app, or thread start up, and then the buffers get reused for hours or days before a reboot anyway.

Also for pools that I want to use, I don't see that tracking init per buf is important. If the compiler lets me, I would avoid tracking init and simply zero all the memory ahead of time.

FrankReh

Looks good to me. I'll plan on merging tomorrow to give others time to still comment. Thanks!

mzabaluev · 2023-01-21T10:26:02Z

I still think this is a wrong place to extend, because it's an imperfect fit for either of the discussed use cases:

It introduces redundant bookkeeping in the Vec<u8> case.
It forces the "segments out of a large mmap" case into extra refcounting in the buffer handles.

I like generics, but not when they de-optimize their most used instantiations. And we already have an extension point in the FixedBuffers vtable, which has a runtime cost. So we might as well amortize the flexibility into that.

Also for pools that I want to use, I don't see that tracking init per buf is important. If the compiler lets me, I would avoid tracking init and simply zero all the memory ahead of time.

While this is a special case where it may be not important (I think zero-initializing is better than admitting UB into your program, though), the proposed change introduces a generic implementation that should work correctly for all buffer types.
Hence why I'm suggesting to address specialized cases at the FixedBuffers level which the current design already allows (pending making the trait public).

ollie-etl · 2023-01-21T10:34:32Z

It introduces redundant bookkeeping in the Vec case.

I really don't follow this logic, sorry. I see no extra overhead.

mzabaluev · 2023-01-21T10:37:18Z

When I have an app that will use the same buffers countless times for I/O, I will certainly want them page aligned.

I believe you get fairly aligned buffers with something like iter::repeat_with(|| Vec::with_capacity(16_384)).take(N).
Also, I've learned that any speculative talks about performance are empty until demonstrated with an actual benchmark.

mzabaluev · 2023-01-21T10:43:19Z

It introduces redundant bookkeeping in the Vec case.

I really don't follow this logic, sorry. I see no extra overhead.

A member of the buffers array has exactly the same information as the corresponding iovec and the init_len member of the BufState::Free struct. Furthermore, it's currently allowed to go out of sync with the init_len value that is updated on buffer check-ins, which ~~technically results in UB when the buffers' Vec<u8> instances are dropped~~ is probably OK for this specific case, but not generically.

…eralised-registries

ollie-etl · 2023-01-21T10:54:07Z

A member of the buffers array has exactly the same information as the corresponding iovec and the init_len member of the BufState::Free struct.

Yes. But:

its no more memory allocated, the buffers are present but "forgotten" in the current implementation.
No extra accounting is done on those buffers (until registry drop)

Are you objecting to the presence of the Vec, (pointer, len and capacity) stored?

Furthermore, it's currently allowed to go out of sync with the init_len value that is updated on buffer check-ins, which technically results in UB when the buffers' Vec instances are dropped is probably OK for this specific case, but not generically.

Its not UB. However, i did update the drop implementation to update the buffers on drop, should they be some Rc backed implementation which takes them elsewhere, and you want to know how many bytes currently rest in them. I'd be very upset if the compiler didn't strip that completely in the case you do just drop, so don't believe this is an overhead.

ollie-etl · 2023-01-21T10:55:57Z

Hmm, that last change does appear to have broken test - will check

Fixed, I think the original had a bug which has just been exposed

src/buf/fixed/handle.rs

ollie-etl · 2023-01-21T11:15:54Z

@FrankReh You'll want to review the changes here also. I don't think they change the Pr much

src/buf/fixed/handle.rs

FrankReh · 2023-01-21T12:20:49Z

I have to wait until much later today to look. Family dentistry emergency.

I like the discussion, when it's not personal. It seems the code is getting better as the two main protagonists here are talking.

I'll remind everyone the whole idea behind supporting uring is to get at performance gains between the userland and the kernel/hardware and we generally do it in good faith, without benchmarks to prove that we are on the right track. And my opinion is we try to present the most generic approach we can that fits into the boundaries of the tokio ecosystem. If things work out of the box, that makes for a nice user experience.

(I hope my opinions aren't used against me.)

Thanks. (And a reminder for everyone to take care of their teeth while they can.)

ollie-etl · 2023-01-21T13:21:20Z

I like the discussion, when it's not personal. It seems the code is getting better as the two main protagonists here are talking.

Indeed. Whilst its true i can find getting things reviewed in this particular repository frustrating, I'll have to admit the review always resulted in better code. As for contributers, I've nothing but thanks for people who give theirs (or their companies time) for improving OS code. I'll note that a lot of time seems to occur here out of hours.

examples/tcp_listener_fixed_buffers.rs

FrankReh

It's looking good. A lot of brain cycles have been put into this little corner of the crate.

src/buf/fixed/handle.rs

src/buf/fixed/registry.rs

src/buf/fixed/pool.rs

FrankReh

One unresolved comment left I think.

src/buf/fixed/handle.rs

ollie-etl · 2023-01-23T21:15:50Z

All comments addressed! LGTM?

FrankReh

Well done sir!

mzabaluev suggested changes Jan 19, 2023

View reviewed changes

src/buf/fixed/registry.rs Show resolved Hide resolved

src/buf/fixed/pool.rs Show resolved Hide resolved

ollie-etl added 4 commits January 20, 2023 09:33

Collect into buffer

f468752

For the common case of cassing in a Vec<T>, this should collect in place.

Don't forget to drop iovecs

5fc7561

Format

4eef5a1

Merge branch 'master' into generalised-registries

2de9a11

ollie-etl added a commit to ollie-etl/tokio-uring that referenced this pull request Jan 20, 2023

https://github.com/tokio-rs/tokio-uring/pull/213

b970495

FrankReh requested changes Jan 20, 2023

View reviewed changes

src/buf/fixed/registry.rs Show resolved Hide resolved

src/buf/fixed/pool.rs Outdated Show resolved Hide resolved

src/buf/fixed/registry.rs Outdated Show resolved Hide resolved

ollie-etl added 3 commits January 20, 2023 19:36

Add comments as per review

d27104c

Addresss drop order comment

0935d3e

Merge branch 'master' into generalised-registries

49d2a18

FrankReh approved these changes Jan 20, 2023

View reviewed changes

Generalise FixedBuf to not require Vec backing

de16a1c

ollie-etl added 2 commits January 21, 2023 10:45

Call set_init on buffers prior to release

3c11471

Merge remote-tracking branch 'origin/generalised-registries' into gen…

ffc5405

…eralised-registries

Fix logic error

c6f1a6a

ollie-etl commented Jan 21, 2023

View reviewed changes

src/buf/fixed/handle.rs Outdated Show resolved Hide resolved

ollie-etl commented Jan 21, 2023

View reviewed changes

src/buf/fixed/handle.rs Show resolved Hide resolved

Remove dead_code marked, it isn't

1dc29b3

FrankReh reviewed Jan 22, 2023

View reviewed changes

examples/tcp_listener_fixed_buffers.rs Show resolved Hide resolved

FrankReh requested changes Jan 22, 2023

View reviewed changes

ollie-etl added 2 commits January 23, 2023 08:27

Rename variable

cb2ed57

Apply review feedback

9023800

FrankReh requested changes Jan 23, 2023

View reviewed changes

src/buf/fixed/handle.rs Show resolved Hide resolved

src/buf/fixed/handle.rs Show resolved Hide resolved

Apply review comment

068877c

ollie-etl mentioned this pull request Jan 23, 2023

API: Fix unintuitive / broken set_init() behaviour #215

Closed

FrankReh approved these changes Jan 24, 2023

View reviewed changes

FrankReh merged commit 0acb432 into tokio-rs:master Jan 24, 2023

Generialise FixedBufRegistry and FixedBufPool #213

Generialise FixedBufRegistry and FixedBufPool #213

Uh oh!

Conversation

ollie-etl commented Jan 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ollie-etl commented Jan 19, 2023

Uh oh!

FrankReh commented Jan 19, 2023

Uh oh!

mzabaluev commented Jan 19, 2023

Uh oh!

Uh oh!

Uh oh!

FrankReh commented Jan 19, 2023

Uh oh!

ollie-etl commented Jan 20, 2023

Uh oh!

FrankReh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ollie-etl commented Jan 20, 2023

Uh oh!

ollie-etl commented Jan 20, 2023

Uh oh!

ollie-etl commented Jan 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

FrankReh commented Jan 20, 2023

Uh oh!

FrankReh left a comment

Choose a reason for hiding this comment

Uh oh!

mzabaluev commented Jan 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ollie-etl commented Jan 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mzabaluev commented Jan 21, 2023

Uh oh!

mzabaluev commented Jan 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ollie-etl commented Jan 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ollie-etl commented Jan 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ollie-etl commented Jan 21, 2023

Uh oh!

Uh oh!

FrankReh commented Jan 21, 2023

Uh oh!

ollie-etl commented Jan 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

FrankReh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

FrankReh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ollie-etl commented Jan 19, 2023 •

edited

Loading

ollie-etl commented Jan 20, 2023 •

edited

Loading

mzabaluev commented Jan 21, 2023 •

edited

Loading

ollie-etl commented Jan 21, 2023 •

edited

Loading

mzabaluev commented Jan 21, 2023 •

edited

Loading

ollie-etl commented Jan 21, 2023 •

edited

Loading

ollie-etl commented Jan 21, 2023 •

edited

Loading

ollie-etl commented Jan 21, 2023 •

edited

Loading

ollie-etl commented Jan 23, 2023 •

edited

Loading