[Frontend] Set MAX_AUDIO_CLIP_FILESIZE_MB via env var instead of hardcoding #21374

deven-labovitch · 2025-07-22T13:19:12Z

Purpose

Set MAX_AUDIO_CLIP_FILESIZE_MB via env var in SpeechToTextConfig instead of hardcoding to 25

Test Plan

Startup vllm locally with and without VLLM_MAX_AUDIO_CLIP_FILESIZE_MB set. Ensure VLLM_MAX_AUDIO_CLIP_FILESIZE_MB defaults to 25 when unset

Test Result

Passed

gemini-code-assist

Code Review

The pull request successfully makes MAX_AUDIO_CLIP_FILESIZE_MB configurable via an environment variable. However, the current implementation is not robust against invalid values for the environment variable, which could lead to a crash on startup. I've suggested a change to handle this case gracefully.

vllm/entrypoints/openai/speech_to_text.py

github-actions · 2025-07-22T13:50:04Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

DarkLight1337 · 2025-07-22T13:52:47Z

cc @NickLucche

NickLucche

Hey, thanks a lot for contributing!

So the intended usage of SpeechToTextConfig is to hold configs for speech-to-text models.
The way I am thinking about MAX_AUDIO_CLIP_FILESIZE_MB is more from a server managing point of view, purely to limit the amount of bytes streamed and handle traffic/networking rates.

In my view this shouldn't be model-dependent so I would either stick with a configurable env var or a startup argument.
In both cases it would be great if you could add a line to the openai server docs, I am pretty sure I left a TODO.

vllm/envs.py

deven-labovitch · 2025-07-22T14:30:01Z

Hey, thanks a lot for contributing!

So the intended usage of SpeechToTextConfig is to hold configs for speech-to-text models. The way I am thinking about MAX_AUDIO_CLIP_FILESIZE_MB is more from a server managing point of view, purely to limit the amount of bytes streamed and handle traffic/networking rates.

In my view this shouldn't be model-dependent so I would either stick with a configurable env var or a startup argument. In both cases it would be great if you could add a line to the openai server docs, I am pretty sure I left a TODO.

Thanks for taking a look! Makes sense that this isn't per model config, just updated the PR to remove it from `SpeechToTextConfig.

Are these the openai server docs? I can add a line! docs/serving/openai_compatible_server.md

deven-labovitch · 2025-07-22T17:50:48Z

Hey, thanks a lot for contributing!

So the intended usage of SpeechToTextConfig is to hold configs for speech-to-text models. The way I am thinking about MAX_AUDIO_CLIP_FILESIZE_MB is more from a server managing point of view, purely to limit the amount of bytes streamed and handle traffic/networking rates.

In my view this shouldn't be model-dependent so I would either stick with a configurable env var or a startup argument. In both cases it would be great if you could add a line to the openai server docs, I am pretty sure I left a TODO.

@NickLucche Just added in a note to docs/serving/openai_compatible_server.md. Thanks!

Set VLLM_MAX_AUDIO_CLIP_FILESIZE_MB

Signed-off-by: Deven Labovitch <[email protected]>

NickLucche

let's just store the env var value once then we're good here

docs/serving/openai_compatible_server.md

vllm/entrypoints/openai/speech_to_text.py

Signed-off-by: Deven Labovitch <[email protected]>

NickLucche

LGMT, thanks for your work!

docs/serving/openai_compatible_server.md

DarkLight1337

Otherwise LGTM

Signed-off-by: Deven Labovitch <[email protected]>

…coding (vllm-project#21374) Signed-off-by: Deven Labovitch <[email protected]> Signed-off-by: 董巍 <[email protected]>

…coding (vllm-project#21374) Signed-off-by: Deven Labovitch <[email protected]> Signed-off-by: avigny <[email protected]>

…coding (vllm-project#21374) Signed-off-by: Deven Labovitch <[email protected]> Signed-off-by: shuw <[email protected]>

…coding (vllm-project#21374) Signed-off-by: Deven Labovitch <[email protected]> Signed-off-by: x22x22 <[email protected]>

…coding (vllm-project#21374) Signed-off-by: Deven Labovitch <[email protected]>

…coding (vllm-project#21374) Signed-off-by: Deven Labovitch <[email protected]> Signed-off-by: Jinzhen Lin <[email protected]>

deven-labovitch requested a review from aarnphm as a code owner July 22, 2025 13:19

mergify bot added the frontend label Jul 22, 2025

gemini-code-assist bot reviewed Jul 22, 2025

View reviewed changes

vllm/entrypoints/openai/speech_to_text.py Outdated Show resolved Hide resolved

deven-labovitch requested review from simon-mo, WoosukKwon, youkaichao, robertgshaw2-redhat, mgoin, tlrmchlsmth, houseroad and hmellor as code owners July 22, 2025 13:27

deven-labovitch changed the title ~~Set MAX_AUDIO_CLIP_FILESIZE_MB via env var and default to 25~~ Set MAX_AUDIO_CLIP_FILESIZE_MB via SpeechToTextConfig Jul 22, 2025

deven-labovitch changed the title ~~Set MAX_AUDIO_CLIP_FILESIZE_MB via SpeechToTextConfig~~ Set MAX_AUDIO_CLIP_FILESIZE_MB via env var instead of hardcoding Jul 22, 2025

NickLucche suggested changes Jul 22, 2025

View reviewed changes

vllm/envs.py Show resolved Hide resolved

mergify bot added the documentation Improvements or additions to documentation label Jul 22, 2025

Signed-off-by: Deven Labovitch <[email protected]>

5c7b9bb

Set VLLM_MAX_AUDIO_CLIP_FILESIZE_MB

deven-labovitch force-pushed the deven/set-MAX_AUDIO_CLIP_FILESIZE_MB-by-env-var branch from 0c54e1b to 5c7b9bb Compare July 22, 2025 19:58

remove extra space

957fc6a

Signed-off-by: Deven Labovitch <[email protected]>

deven-labovitch requested a review from NickLucche July 23, 2025 13:19

NickLucche suggested changes Jul 23, 2025

View reviewed changes

docs/serving/openai_compatible_server.md Show resolved Hide resolved

vllm/entrypoints/openai/speech_to_text.py Outdated Show resolved Hide resolved

PR comment

15844a2

Signed-off-by: Deven Labovitch <[email protected]>

deven-labovitch requested a review from NickLucche July 23, 2025 15:11

NickLucche approved these changes Jul 23, 2025

View reviewed changes

DarkLight1337 reviewed Jul 23, 2025

View reviewed changes

docs/serving/openai_compatible_server.md Outdated Show resolved Hide resolved

DarkLight1337 approved these changes Jul 23, 2025

View reviewed changes

PR comment

d563bcc

Signed-off-by: Deven Labovitch <[email protected]>

deven-labovitch changed the title ~~Set MAX_AUDIO_CLIP_FILESIZE_MB via env var instead of hardcoding~~ [Frontend] Set MAX_AUDIO_CLIP_FILESIZE_MB via env var instead of hardcoding Jul 23, 2025

deven-labovitch requested review from DarkLight1337 and NickLucche July 23, 2025 20:07

DarkLight1337 approved these changes Jul 24, 2025

View reviewed changes

vllm-bot merged commit 63d92ab into vllm-project:main Jul 24, 2025
15 checks passed

DW934 pushed a commit to DW934/vllm that referenced this pull request Jul 24, 2025

[Frontend] Set MAX_AUDIO_CLIP_FILESIZE_MB via env var instead of hard…

7cf510e

…coding (vllm-project#21374) Signed-off-by: Deven Labovitch <[email protected]> Signed-off-by: 董巍 <[email protected]>

DW934 pushed a commit to DW934/vllm that referenced this pull request Jul 28, 2025

[Frontend] Set MAX_AUDIO_CLIP_FILESIZE_MB via env var instead of hard…

1890097

…coding (vllm-project#21374) Signed-off-by: Deven Labovitch <[email protected]> Signed-off-by: 董巍 <[email protected]>

avigny pushed a commit to avigny/vllm that referenced this pull request Jul 31, 2025

[Frontend] Set MAX_AUDIO_CLIP_FILESIZE_MB via env var instead of hard…

d770f5d

…coding (vllm-project#21374) Signed-off-by: Deven Labovitch <[email protected]> Signed-off-by: avigny <[email protected]>

wenscarl pushed a commit to wenscarl/vllm that referenced this pull request Aug 4, 2025

[Frontend] Set MAX_AUDIO_CLIP_FILESIZE_MB via env var instead of hard…

acbcab3

…coding (vllm-project#21374) Signed-off-by: Deven Labovitch <[email protected]> Signed-off-by: shuw <[email protected]>

x22x22 pushed a commit to x22x22/vllm that referenced this pull request Aug 5, 2025

[Frontend] Set MAX_AUDIO_CLIP_FILESIZE_MB via env var instead of hard…

c733571

…coding (vllm-project#21374) Signed-off-by: Deven Labovitch <[email protected]> Signed-off-by: x22x22 <[email protected]>

Pradyun92 pushed a commit to Pradyun92/vllm that referenced this pull request Aug 6, 2025

[Frontend] Set MAX_AUDIO_CLIP_FILESIZE_MB via env var instead of hard…

a31ec9b

…coding (vllm-project#21374) Signed-off-by: Deven Labovitch <[email protected]>

npanpaliya pushed a commit to odh-on-pz/vllm-upstream that referenced this pull request Aug 6, 2025

[Frontend] Set MAX_AUDIO_CLIP_FILESIZE_MB via env var instead of hard…

22574cd

…coding (vllm-project#21374) Signed-off-by: Deven Labovitch <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Frontend] Set MAX_AUDIO_CLIP_FILESIZE_MB via env var instead of hardcoding #21374

[Frontend] Set MAX_AUDIO_CLIP_FILESIZE_MB via env var instead of hardcoding #21374

Uh oh!

deven-labovitch commented Jul 22, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

github-actions bot commented Jul 22, 2025

Uh oh!

DarkLight1337 commented Jul 22, 2025

Uh oh!

NickLucche left a comment

Uh oh!

Uh oh!

deven-labovitch commented Jul 22, 2025 •

edited

Loading

Uh oh!

deven-labovitch commented Jul 22, 2025 •

edited

Loading

Uh oh!

NickLucche left a comment

Uh oh!

Uh oh!

Uh oh!

NickLucche left a comment

Uh oh!

Uh oh!

DarkLight1337 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[Frontend] Set MAX_AUDIO_CLIP_FILESIZE_MB via env var instead of hardcoding #21374

[Frontend] Set MAX_AUDIO_CLIP_FILESIZE_MB via env var instead of hardcoding #21374

Uh oh!

Conversation

deven-labovitch commented Jul 22, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

github-actions bot commented Jul 22, 2025

Uh oh!

DarkLight1337 commented Jul 22, 2025

Uh oh!

NickLucche left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

deven-labovitch commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

deven-labovitch commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NickLucche left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

NickLucche left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

deven-labovitch commented Jul 22, 2025 •

edited by github-actions bot

Loading

deven-labovitch commented Jul 22, 2025 •

edited

Loading

deven-labovitch commented Jul 22, 2025 •

edited

Loading