[Benchmark] Add `--async-engine` option to benchmark_throughput.py #7964

njhill · 2024-08-28T19:11:55Z

Uses AsyncLLMEngine interface (rather than LLM)

Will use decoupled front-end depending on whether or not --disable-frontend-multiprocessing is also specified.

vllm/entrypoints/openai/api_server.py

Uses AsyncLLMEngine interface (or rather than LLM) Will use decoupled front-end depending on whether or not --disable-frontend-multiprocessing is also specified.

comaniac

LGTM. Just a nit

comaniac · 2024-09-04T00:36:15Z

benchmarks/benchmark_throughput.py

+        if args.async_engine:
+            run_args.append(args.disable_frontend_multiprocessing)
+            elapsed_time = uvloop.run(run_vllm_async(*run_args))
+        else:
+            elapsed_time = run_vllm(*run_args)


Nit: better to use kwargs

Agree, but the list was already passed as regular args so this involved minimal changes.

…llm-project#7964) Signed-off-by: Alvant <[email protected]>

…llm-project#7964) Signed-off-by: LeiWang1999 <[email protected]>

robertgshaw2-redhat reviewed Aug 28, 2024

View reviewed changes

vllm/entrypoints/openai/api_server.py Outdated Show resolved Hide resolved

robertgshaw2-redhat reviewed Aug 28, 2024

View reviewed changes

vllm/entrypoints/openai/api_server.py Outdated Show resolved Hide resolved

njhill mentioned this pull request Aug 28, 2024

[Frontend] Minor optimizations to zmq decoupled front-end #7957

Merged

[Benchmark] Add --async-engine option to benchmark_throughput.py

bd5ba81

Uses AsyncLLMEngine interface (or rather than LLM) Will use decoupled front-end depending on whether or not --disable-frontend-multiprocessing is also specified.

njhill force-pushed the async-llm-eng-bench branch from a7a6e43 to bd5ba81 Compare September 3, 2024 21:44

njhill changed the title ~~[Benchmark] Add async throughput benchmark~~ [Benchmark] Add --async-engine option to benchmark_throughput.py Sep 3, 2024

vllm-project deleted a comment from github-actions bot Sep 3, 2024

njhill marked this pull request as ready for review September 3, 2024 21:48

njhill changed the title ~~[Benchmark] Add --async-engine option to benchmark_throughput.py~~ [Benchmark] Add --async-engine option to benchmark_throughput.py Sep 3, 2024

Fix typing errors

f58f49f

njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 3, 2024

comaniac approved these changes Sep 4, 2024

View reviewed changes

robertgshaw2-redhat merged commit d4db9f5 into vllm-project:main Sep 4, 2024
47 checks passed

njhill deleted the async-llm-eng-bench branch September 4, 2024 01:01

njhill mentioned this pull request Sep 4, 2024

[Core][Bugfix][Perf] Refactor Server to Avoid AsyncLLMEngine #8092

Closed

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Benchmark] Add --async-engine option to benchmark_throughput.py (v…

f80915f

…llm-project#7964) Signed-off-by: Alvant <[email protected]>

LeiWang1999 pushed a commit to LeiWang1999/vllm-bitblas that referenced this pull request Mar 26, 2025

[Benchmark] Add --async-engine option to benchmark_throughput.py (v…

4a39017

…llm-project#7964) Signed-off-by: LeiWang1999 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Benchmark] Add `--async-engine` option to benchmark_throughput.py #7964

[Benchmark] Add `--async-engine` option to benchmark_throughput.py #7964

Uh oh!

njhill commented Aug 28, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

comaniac left a comment

Uh oh!

comaniac Sep 4, 2024

Uh oh!

njhill Sep 4, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[Benchmark] Add --async-engine option to benchmark_throughput.py #7964

[Benchmark] Add --async-engine option to benchmark_throughput.py #7964

Uh oh!

Conversation

njhill commented Aug 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

comaniac left a comment

Choose a reason for hiding this comment

Uh oh!

comaniac Sep 4, 2024

Choose a reason for hiding this comment

Uh oh!

njhill Sep 4, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

[Benchmark] Add `--async-engine` option to benchmark_throughput.py #7964

[Benchmark] Add `--async-engine` option to benchmark_throughput.py #7964

njhill commented Aug 28, 2024 •

edited

Loading