Optimize TLS uplink throughput #166

dmitry-tarnyagin · 2025-10-24T08:39:58Z

Hei,

This PR introduces two optimizations aimed to boost TLS uplink performance:

Key caching
Adds optional caching for expanded session keys and IVs, eliminating costly re-expansion for every packet.
Configurable flush policy
Introduces a FlushPolicy to control when the TLS layer flushes its transport.
In Relaxed mode, it skips per-record flushes, removing TCP ACK delays and allowing TCP-level buffering to work.

Regards,
Dmitry

Cache expanded session keys that are used to encrypt and decrypt traffic. This change is potentially unsafe and is enabled only when the "key-cache" feature flag is set.

Introduce a configurable FlushPolicy to control when the TLS layer flushes its underlying transport. This change adds a FlushPolicy enum (Relaxed or Strict) and wires it into TlsConnection and TlsWriter. The policy determines whether the transport’s flush() is called after writing a TLS record. With Strict (the default), the transport is always flushed, ensuring that data is acknowledged or committed before continuing. This mode is compatible with existing behavior. With Relaxed, the TLS layer closes the record and hands bytes to the transport without forcing a flush, allowing buffered writes to improve performance and reduce latency. This gives callers explicit control over how aggressively the transport flushes data. It's particularly important for transports like embassy-net TCP, where flush() blocks waiting for ACKs. The default remains Strict to preserve compatibility with embedded-tls 0.17.0, while the Relaxed mode can be selected together with larger socket window for improved throughput.

lulf · 2025-10-24T11:18:51Z

Hei,

This PR introduces two optimizations aimed to boost TLS uplink performance:

* Key caching
  Adds optional caching for expanded session keys and IVs, eliminating costly re-expansion for every packet.

* Configurable flush policy
  Introduces a FlushPolicy to control when the TLS layer flushes its transport.
  In Relaxed mode, it skips per-record flushes, removing TCP ACK delays and allowing TCP-level buffering to work.

I'm wondering, could we just not do per-record flushes and rely on TCP buffering? I've never tried that, so it's more of a question if you've tried it with embassy-net.

dmitry-tarnyagin · 2025-10-24T11:25:16Z

I'm wondering, could we just not do per-record flushes and rely on TCP buffering? I've never tried that, so it's more of a question if you've tried it with embassy-net.

It depends. With embassy-net as a transport you should not do flushes, as it's the stack's responsibility to send data (, buffer, handle retransmits, etc). But if you for some reason is tunneling embedded-tls over embedded-tls (why not?:)), you'll face a problem.

At the same time it's ok to flush handshakes and other request-response transactions (if any), were you anyway have to wait for a reply.

lulf · 2025-10-24T12:02:25Z

I don't see any reason to explicitly flush unless there is some cases I'm missing. Thoughts, @rmja, @newAM ? I'd rather just change it to not explicitly flush and avoid having the flush strategy.

rmja · 2025-10-24T12:17:49Z

I guess the expectation for flush() is that the propagates all the way through the stack, and that any buffering that has occurred in an intermediate layer is delegated to the next. If some layer below embedded-tls buffers in any way, then it will not receive the flush() if we hijack it. For that reason I think something like "strict flush" is good and should be the default (like in the pr).

That being said, I can understand why you would want something like this, when you are able to control flush behavior on the below layer explicitly.

newAM · 2025-10-25T03:05:32Z

No opinions on the flushing. I'm not familiar enough with the underlying transports to have a solid mental model of the consequences of the flushing change.

The key cache seems logical, but is there a reason to hide it behind a feature? The memory cost of storing an IV and Key is negligible compared to the overall footprint of embedded-tls. I would prefer this to be enabled by default without a feature to disable it. Easier to maintain.

dmitry-tarnyagin · 2025-10-25T08:59:29Z

@newAM

is there a reason to hide it behind a feature?

I'm not entirely sure. On one hand, it's perfectly fine for me to cache them and even derive them immediately in calculate_traffic_secret(). On the other hand, keeping ready-to-use keys in readable memory sounds a bit risky (yes, I know the key material is already readable). I'm not a security expert, so the feature flag allows others to decide.

newAM · 2025-10-26T03:48:00Z

is there a reason to hide it behind a feature?

I'm not entirely sure. On one hand, it's perfectly fine for me to cache them and even derive them immediately in calculate_traffic_secret(). On the other hand, keeping ready-to-use keys in readable memory sounds a bit risky (yes, I know the key material is already readable). I'm not a security expert, so the feature flag allows others to decide.

When it comes to features for the purposes of security my method is to describe a threat model and then evaluate if the feature is sufficient to mitigate that threat. I can't think of a threat model where a motivated attacker obtains arbitrary memory access then is stopped by key derivation, which is why I suggested this is enabled by default.

dmitry-tarnyagin · 2025-10-26T06:32:36Z

@newAM That's perfectly fine for me, I will update it w/o a feature. Thanks for review!

dmitry-tarnyagin · 2025-10-28T14:25:08Z

@lulf Updated, ready for review.

dmitry-tarnyagin added 2 commits October 23, 2025 22:29

Optionally cache expanded keys

c5fa732

Cache expanded session keys that are used to encrypt and decrypt traffic. This change is potentially unsafe and is enabled only when the "key-cache" feature flag is set.

Implement caching of expanded keys as a permanent feature

ab0c892

dmitry-tarnyagin force-pushed the feature/key-cache-n-flush-policy branch from 09d165e to ab0c892 Compare October 28, 2025 14:21

cargo fmt

06ab0bd

lulf approved these changes Oct 28, 2025

View reviewed changes

lulf merged commit dccd966 into drogue-iot:main Oct 28, 2025
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Optimize TLS uplink throughput #166

Optimize TLS uplink throughput #166

dmitry-tarnyagin commented Oct 24, 2025

Uh oh!

lulf commented Oct 24, 2025

Uh oh!

dmitry-tarnyagin commented Oct 24, 2025 •

edited

Loading

Uh oh!

lulf commented Oct 24, 2025

Uh oh!

rmja commented Oct 24, 2025

Uh oh!

newAM commented Oct 25, 2025

Uh oh!

dmitry-tarnyagin commented Oct 25, 2025 •

edited

Loading

Uh oh!

newAM commented Oct 26, 2025

Uh oh!

dmitry-tarnyagin commented Oct 26, 2025

Uh oh!

dmitry-tarnyagin commented Oct 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Optimize TLS uplink throughput #166

Optimize TLS uplink throughput #166

Conversation

dmitry-tarnyagin commented Oct 24, 2025

Uh oh!

lulf commented Oct 24, 2025

Uh oh!

dmitry-tarnyagin commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lulf commented Oct 24, 2025

Uh oh!

rmja commented Oct 24, 2025

Uh oh!

newAM commented Oct 25, 2025

Uh oh!

dmitry-tarnyagin commented Oct 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

newAM commented Oct 26, 2025

Uh oh!

dmitry-tarnyagin commented Oct 26, 2025

Uh oh!

dmitry-tarnyagin commented Oct 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dmitry-tarnyagin commented Oct 24, 2025 •

edited

Loading

dmitry-tarnyagin commented Oct 25, 2025 •

edited

Loading