[frontend] spawn engine process from api server process #7484

youkaichao · 2024-08-13T21:29:01Z

replace #7411

github-actions · 2024-08-13T21:29:26Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

njhill · 2024-08-13T21:39:30Z

tests/entrypoints/openai/test_oot_registration.py

-        sampling_metadata: SamplingMetadata,
-    ) -> Optional[torch.Tensor]:
+    def compute_logits(self, hidden_states: torch.Tensor,
+                       sampling_metadata: SamplingMetadata) -> torch.Tensor:


any particular reason for this change? it's Optional in the superclass method that it's overriding...

this is automatically done by the linter, I think due to the version change of linter.

njhill

Thanks!

youkaichao · 2024-08-13T22:03:06Z

should also close #7151

ameza13 · 2024-08-14T14:09:34Z

Hello, by when will it be a release with this fix available?

youkaichao · 2024-08-14T16:46:13Z

@ameza13 see #7481

…#7484) Signed-off-by: Alvant <[email protected]>

…#7484) Signed-off-by: LeiWang1999 <[email protected]>

youkaichao added 3 commits August 13, 2024 12:47

use spawn

4182efe

update mp crash

8a806b0

update tests

3848837

youkaichao requested a review from njhill August 13, 2024 21:29

youkaichao added 3 commits August 13, 2024 14:32

add tests

68e48ac

update

62189f3

update

4e2505c

njhill reviewed Aug 13, 2024

View reviewed changes

njhill approved these changes Aug 13, 2024

View reviewed changes

youkaichao mentioned this pull request Aug 13, 2024

[frontend] isolate api server process and engine process #7411

Closed

youkaichao linked an issue Aug 13, 2024 that may be closed by this pull request

[Bug]: ngc24.05 "RuntimeError: Cannot re-initialize CUDA in forked subprocess." #7246

Closed

youkaichao merged commit 33e5d7e into vllm-project:main Aug 13, 2024
27 checks passed

youkaichao deleted the spawn_engine branch August 13, 2024 22:40

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[frontend] spawn engine process from api server process (vllm-project…

71e2eb1

…#7484) Signed-off-by: Alvant <[email protected]>

LeiWang1999 pushed a commit to LeiWang1999/vllm-bitblas that referenced this pull request Mar 26, 2025

[frontend] spawn engine process from api server process (vllm-project…

85dedbd

…#7484) Signed-off-by: LeiWang1999 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[frontend] spawn engine process from api server process #7484

[frontend] spawn engine process from api server process #7484

Uh oh!

youkaichao commented Aug 13, 2024

Uh oh!

github-actions bot commented Aug 13, 2024

Uh oh!

njhill Aug 13, 2024

Uh oh!

youkaichao Aug 13, 2024

Uh oh!

njhill left a comment

Uh oh!

youkaichao commented Aug 13, 2024

Uh oh!

Uh oh!

ameza13 commented Aug 14, 2024 •

edited

Loading

Uh oh!

youkaichao commented Aug 14, 2024

Uh oh!

Uh oh!

Uh oh!

[frontend] spawn engine process from api server process #7484

[frontend] spawn engine process from api server process #7484

Uh oh!

Conversation

youkaichao commented Aug 13, 2024

Uh oh!

github-actions bot commented Aug 13, 2024

Uh oh!

njhill Aug 13, 2024

Choose a reason for hiding this comment

Uh oh!

youkaichao Aug 13, 2024

Choose a reason for hiding this comment

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

youkaichao commented Aug 13, 2024

Uh oh!

Uh oh!

ameza13 commented Aug 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

youkaichao commented Aug 14, 2024

Uh oh!

Uh oh!

ameza13 commented Aug 14, 2024 •

edited

Loading