Skip to content

Commit 6409969

Browse files
youkaichaoLeiWang1999
authored andcommitted
[ci][distributed] fix flaky tests (vllm-project#6806)
Signed-off-by: LeiWang1999 <[email protected]>
1 parent e89cb93 commit 6409969

File tree

1 file changed

+8
-1
lines changed

1 file changed

+8
-1
lines changed

tests/distributed/test_pipeline_parallel.py

Lines changed: 8 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,10 @@
1+
"""
2+
WARNING: This test runs in both single-node (4 GPUs) and multi-node
3+
(2 node with 2 GPUs each) modes. If the test only uses 2 GPUs, it is
4+
important to set the distributed backend to "mp" to avoid Ray scheduling
5+
all workers in a node other than the head node, which can cause the test
6+
to fail.
7+
"""
18
import os
29

310
import pytest
@@ -78,7 +85,7 @@ def test_pp_cudagraph(PP_SIZE, MODEL_NAME, ATTN_BACKEND):
7885
"--pipeline-parallel-size",
7986
str(PP_SIZE),
8087
"--distributed-executor-backend",
81-
"ray",
88+
"mp",
8289
]
8390
os.environ["VLLM_ATTENTION_BACKEND"] = ATTN_BACKEND
8491

0 commit comments

Comments
 (0)