Add streaming support to OuteTTS #169

lucasnewman · 2025-05-19T23:29:36Z

This allows reducing time-to-first-speech for OuteTTS, which can play in realtime using the 8-bit quantized model on my M3 Max.

mlx_lm.generate --model mlx-community/Llama-3.2-1B-Instruct-4bit --verbose false --temp 0 --max-tokens 512 --prompt "Write one sentence on wavelets." |\
python -m mlx_audio.tts.generate --model mlx-community/OuteTTS-1.0-0.6B-8bit --stream --verbose

lin72h · 2025-05-19T23:42:29Z

@lucasnewman Thanks for perfecting this, it's very usable

Blaizzy · 2025-05-19T23:59:43Z

Thanks @lucasnewman, this is awesome!

Could you try and add some overlap similar to the demo implementation we came up with a month ago?

I think it will help with the cuts and slight noise when going over disjoint segments. Also, not sure but the audio player might need an update to play with overlaps.

https://gist.github.com/Blaizzy/c18a52509fec3cbc5cbd64b07e33ca1c

lucasnewman · 2025-05-20T00:36:52Z

Thanks @lucasnewman, this is awesome!

Could you try and add some overlap similar to the demo implementation we came up with a month ago?

I think it will help with the cuts and slight noise when going over disjoint segments. Also, not sure but the audio player might need an update to play with overlaps.

https://gist.github.com/Blaizzy/c18a52509fec3cbc5cbd64b07e33ca1c

It decodes the entire segment for each chunk, so it’s technically like 100% overlap. I complied the decoder loop to mitigate the performance hit of the full decode for that reason, as Descript doesn’t really support “streaming” chunk-wise. If the audio player is running dry, try turning up the streaming interval — it’s device-specific and we’re not doing any ahead of time buffering, so if it’s slower than realtime it will always have gaps.

We could add some kind of configurable start delay to allow ahead-of-time buffering or try a simple heuristic to look at the samples/sec generation speed and buffer accordingly, but it gets complex quickly. I think it might be better handled as a follow up to enhance the audio player buffering capabilities.

Blaizzy

LGTM!

Phenomenal work 🚀

Add streaming support to OuteTTS.

5ceceb3

lucasnewman requested a review from Blaizzy May 19, 2025 23:29

lucasnewman mentioned this pull request May 20, 2025

Audio player output buffering for streaming mode #170

Merged

lucasnewman and others added 3 commits May 22, 2025 10:19

Merge remote-tracking branch 'origin/main' into outetts-streaming

0e63d25

Formatting.

e710484

Merge branch 'main' into outetts-streaming

c8a056b

Blaizzy approved these changes May 24, 2025

View reviewed changes

Blaizzy merged commit 1eb879e into Blaizzy:main May 24, 2025
2 checks passed

lucasnewman deleted the outetts-streaming branch August 17, 2025 18:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add streaming support to OuteTTS #169

Add streaming support to OuteTTS #169

lucasnewman commented May 19, 2025

Uh oh!

lin72h commented May 19, 2025

Uh oh!

Blaizzy commented May 19, 2025

Uh oh!

lucasnewman commented May 20, 2025

Uh oh!

Blaizzy left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Add streaming support to OuteTTS #169

Add streaming support to OuteTTS #169

Conversation

lucasnewman commented May 19, 2025

Uh oh!

lin72h commented May 19, 2025

Uh oh!

Blaizzy commented May 19, 2025

Uh oh!

lucasnewman commented May 20, 2025

Uh oh!

Blaizzy left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Blaizzy left a comment •

edited

Loading