[V0 Deprecation] Remove pooling model support in V0 #23434

maxdebayser · 2025-08-22T15:11:40Z

Continuation of PR #23302

Summary

drop pooling model runner and related code paths in V0 worker
simplify V0 engine to handle only sampling requests
add stubs that raise for pooling model entry points

@DarkLight1337 , @noooop , do you remember anything else to remove?

Signed-off-by: Woosuk Kwon <[email protected]>

Signed-off-by: Max de Bayser <[email protected]>

…in-vllm-v0

gemini-code-assist

Code Review

This pull request effectively removes support for pooling models in the vLLM V0 worker, which aligns with the stated objective. The changes are comprehensive, touching upon the engine, worker, sequence management, and tests to eliminate the V0 pooling code paths. Key modifications include the removal of PoolingModelRunner and V0-specific pooling metadata, updating LLMEngine and AsyncLLMEngine to handle only sampling requests, and stubbing out pooling-related entry points to raise NotImplementedError. The removal of token_type_ids and related logic is also consistent with this goal. The code modifications appear to be correct and consistently applied across the repository. I have not found any issues of high or critical severity.

DarkLight1337

Not that I'm aware of.

vllm/inputs/preprocess.py

noooop · 2025-08-22T16:28:38Z

I think it’s quite difficult to clean it up all at once. It might take some time to completely remove all the “compatibility code”.

I hope this PR landing as soon as possible, so we can focus our optimization efforts on a single engine.

Signed-off-by: Max de Bayser <[email protected]>

maxdebayser · 2025-08-22T19:58:33Z

I'm running into weird typing issues. Why is EngineClient.generate() not async if in classes that implement this interface the method is async (see AssyncLLMEngine and AssyncLLM ) . And why is it called without await?

DarkLight1337 · 2025-08-23T00:25:45Z

cc @robertgshaw2-redhat

noooop · 2025-08-23T03:22:26Z

@maxdebayser

Remove all run_with_both_engines in vllm/tests/models/language/pooling and vllm/tests/entrypoints/

Let's land this PR quickly, as long as ci turns green, then polish it in subsequent PRs.

noooop · 2025-08-25T10:57:36Z

clean up v0_only in tests/models/registry.py

Signed-off-by: Max de Bayser <[email protected]>

…in-vllm-v0

Signed-off-by: Max de Bayser <[email protected]>

maxdebayser · 2025-08-26T16:25:18Z

@DarkLight1337 , the basic tests are passing now. Can you enable the full test suite?

DarkLight1337

Can merge if tests pass

…in-vllm-v0 Signed-off-by: Max de Bayser <[email protected]>

Signed-off-by: Max de Bayser <[email protected]>

…in-vllm-v0 Signed-off-by: Max de Bayser <[email protected]>

…in-vllm-v0

…in-vllm-v0 Signed-off-by: Max de Bayser <[email protected]>

Signed-off-by: Max de Bayser <[email protected]>

noooop · 2025-08-29T07:02:29Z

@DarkLight1337

Is the failed test caused by a flaky test?

DarkLight1337 · 2025-08-29T07:03:52Z

Yes

maxdebayser · 2025-08-29T13:15:53Z

Follow-up issue: #23883

WoosukKwon and others added 5 commits August 21, 2025 10:04

merge

1a18b39

Signed-off-by: Woosuk Kwon <[email protected]>

remove v0 token_type_ids support

0dbbcec

Signed-off-by: Max de Bayser <[email protected]>

remove V0 pooling metadata

2927be0

Signed-off-by: Max de Bayser <[email protected]>

fix type annotation

8d5ff07

Signed-off-by: Max de Bayser <[email protected]>

Merge branch 'upstream_main' into codex/remove-pooling-model-support-…

97467b7

…in-vllm-v0

maxdebayser requested review from aarnphm, DarkLight1337, ywang96, zhuohan123, youkaichao, alexm-redhat, comaniac and njhill as code owners August 22, 2025 15:11

mergify bot added frontend multi-modality Related to multi-modality (#4194) labels Aug 22, 2025

mergify bot mentioned this pull request Aug 22, 2025

[V0 Deprecation] Remove pooling model support in V0 #23432

Closed

maxdebayser changed the title ~~Codex/remove pooling model support in vllm v0~~ [V0 Deprecation] Remove pooling model support in V0 Aug 22, 2025

maxdebayser mentioned this pull request Aug 22, 2025

[V0 Deprecation] Remove pooling model support in V0 #23302

Closed

gemini-code-assist bot reviewed Aug 22, 2025

View reviewed changes

DarkLight1337 reviewed Aug 22, 2025

View reviewed changes

vllm/inputs/preprocess.py Outdated Show resolved Hide resolved

maxdebayser added 6 commits August 22, 2025 14:21

remove additional token_type_id code

48f4b91

Signed-off-by: Max de Bayser <[email protected]>

revert removal

2343dae

Signed-off-by: Max de Bayser <[email protected]>

fix mistake

09e230e

Signed-off-by: Max de Bayser <[email protected]>

revert change required for linting

ec8068d

Signed-off-by: Max de Bayser <[email protected]>

fix linting

c519709

Signed-off-by: Max de Bayser <[email protected]>

fix linting

5015c34

Signed-off-by: Max de Bayser <[email protected]>

maxdebayser added 3 commits August 25, 2025 16:34

revert changes

44af2fb

Signed-off-by: Max de Bayser <[email protected]>

Merge branch 'upstream_main' into codex/remove-pooling-model-support-…

cad0495

…in-vllm-v0

clean up tests

9f9d357

Signed-off-by: Max de Bayser <[email protected]>

maxdebayser requested review from robertgshaw2-redhat and simon-mo as code owners August 25, 2025 19:48

maxdebayser added 3 commits August 25, 2025 17:39

linting

719258a

Signed-off-by: Max de Bayser <[email protected]>

fix add_lora not returning bool

30d0d42

Signed-off-by: Max de Bayser <[email protected]>

disable test

03986a2

Signed-off-by: Max de Bayser <[email protected]>

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 26, 2025

DarkLight1337 approved these changes Aug 26, 2025

View reviewed changes

maxdebayser added 3 commits August 27, 2025 13:54

Merge branch 'upstream_main' into codex/remove-pooling-model-support-…

563e05d

…in-vllm-v0 Signed-off-by: Max de Bayser <[email protected]>

change version for pooling

26df677

Signed-off-by: Max de Bayser <[email protected]>

Merge branch 'upstream_main' into codex/remove-pooling-model-support-…

73f0897

…in-vllm-v0 Signed-off-by: Max de Bayser <[email protected]>

maxdebayser force-pushed the codex/remove-pooling-model-support-in-vllm-v0 branch from 3341ce9 to 73f0897 Compare August 27, 2025 16:56

Merge branch 'upstream_main' into codex/remove-pooling-model-support-…

7d92913

…in-vllm-v0

noooop mentioned this pull request Aug 28, 2025

[Model] Systematic support for fp32 head, pooling models part #23810

Draft

5 tasks

maxdebayser added 2 commits August 28, 2025 22:34

Merge branch 'upstream_main' into codex/remove-pooling-model-support-…

dc7ec95

…in-vllm-v0 Signed-off-by: Max de Bayser <[email protected]>

disable failing test

73c1bad

Signed-off-by: Max de Bayser <[email protected]>

vllm-bot merged commit 2554b27 into vllm-project:main Aug 29, 2025
44 of 47 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[V0 Deprecation] Remove pooling model support in V0 #23434

[V0 Deprecation] Remove pooling model support in V0 #23434

Uh oh!

maxdebayser commented Aug 22, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

DarkLight1337 left a comment

Uh oh!

Uh oh!

noooop commented Aug 22, 2025 •

edited

Loading

Uh oh!

maxdebayser commented Aug 22, 2025

Uh oh!

DarkLight1337 commented Aug 23, 2025

Uh oh!

noooop commented Aug 23, 2025 •

edited

Loading

Uh oh!

noooop commented Aug 25, 2025

Uh oh!

maxdebayser commented Aug 26, 2025

Uh oh!

DarkLight1337 left a comment

Uh oh!

noooop commented Aug 29, 2025

Uh oh!

DarkLight1337 commented Aug 29, 2025

Uh oh!

Uh oh!

maxdebayser commented Aug 29, 2025

Uh oh!

Uh oh!

Uh oh!

[V0 Deprecation] Remove pooling model support in V0 #23434

[V0 Deprecation] Remove pooling model support in V0 #23434

Uh oh!

Conversation

maxdebayser commented Aug 22, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

noooop commented Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maxdebayser commented Aug 22, 2025

Uh oh!

DarkLight1337 commented Aug 23, 2025

Uh oh!

noooop commented Aug 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

noooop commented Aug 25, 2025

Uh oh!

maxdebayser commented Aug 26, 2025

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

noooop commented Aug 29, 2025

Uh oh!

DarkLight1337 commented Aug 29, 2025

Uh oh!

Uh oh!

maxdebayser commented Aug 29, 2025

Uh oh!

Uh oh!

maxdebayser commented Aug 22, 2025 •

edited by github-actions bot

Loading

noooop commented Aug 22, 2025 •

edited

Loading

noooop commented Aug 23, 2025 •

edited

Loading