Skip to content

Commit 29831a2

Browse files
hmellorIsotr0py
authored andcommitted
[Doc] Rename offline inference examples (vllm-project#11927)
Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: Isotr0py <[email protected]>
1 parent 6a68a7c commit 29831a2

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

46 files changed

+46
-46
lines changed

.buildkite/run-cpu-test.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,7 @@ function cpu_tests() {
3030
# offline inference
3131
docker exec cpu-test-"$BUILDKITE_BUILD_NUMBER"-avx2-"$NUMA_NODE" bash -c "
3232
set -e
33-
python3 examples/offline_inference/offline_inference.py"
33+
python3 examples/offline_inference/basic.py"
3434

3535
# Run basic model test
3636
docker exec cpu-test-"$BUILDKITE_BUILD_NUMBER"-"$NUMA_NODE" bash -c "

.buildkite/run-gh200-test.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,5 +24,5 @@ remove_docker_container
2424

2525
# Run the image and test offline inference
2626
docker run --name gh200-test --gpus=all --entrypoint="" gh200-test bash -c '
27-
python3 examples/offline_inference/offline_inference.py
27+
python3 examples/offline_inference/basic.py
2828
'

.buildkite/run-hpu-test.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,4 +13,4 @@ trap remove_docker_container EXIT
1313
remove_docker_container
1414

1515
# Run the image and launch offline inference
16-
docker run --runtime=habana --name=hpu-test --network=host -e HABANA_VISIBLE_DEVICES=all -e VLLM_SKIP_WARMUP=true --entrypoint="" hpu-test-env python3 examples/offline_inference/offline_inference.py
16+
docker run --runtime=habana --name=hpu-test --network=host -e HABANA_VISIBLE_DEVICES=all -e VLLM_SKIP_WARMUP=true --entrypoint="" hpu-test-env python3 examples/offline_inference/basic.py

.buildkite/run-neuron-test.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -51,4 +51,4 @@ docker run --rm -it --device=/dev/neuron0 --device=/dev/neuron1 --network host \
5151
-e "NEURON_COMPILE_CACHE_URL=${NEURON_COMPILE_CACHE_MOUNT}" \
5252
--name "${container_name}" \
5353
${image_name} \
54-
/bin/bash -c "python3 /workspace/vllm/examples/offline_inference/offline_inference_neuron.py"
54+
/bin/bash -c "python3 /workspace/vllm/examples/offline_inference/neuron.py"

.buildkite/run-openvino-test.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,4 +13,4 @@ trap remove_docker_container EXIT
1313
remove_docker_container
1414

1515
# Run the image and launch offline inference
16-
docker run --network host --env VLLM_OPENVINO_KVCACHE_SPACE=1 --name openvino-test openvino-test python3 /workspace/examples/offline_inference/offline_inference.py
16+
docker run --network host --env VLLM_OPENVINO_KVCACHE_SPACE=1 --name openvino-test openvino-test python3 /workspace/examples/offline_inference/basic.py

.buildkite/run-tpu-test.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -23,4 +23,4 @@ docker run --privileged --net host --shm-size=16G -it \
2323
&& pytest -v -s /workspace/vllm/tests/tpu/test_custom_dispatcher.py \
2424
&& python3 /workspace/vllm/tests/tpu/test_compilation.py \
2525
&& python3 /workspace/vllm/tests/tpu/test_quantization_accuracy.py \
26-
&& python3 /workspace/vllm/examples/offline_inference/offline_inference_tpu.py"
26+
&& python3 /workspace/vllm/examples/offline_inference/tpu.py"

.buildkite/run-xpu-test.sh

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,6 @@ remove_docker_container
1414

1515
# Run the image and test offline inference/tensor parallel
1616
docker run --name xpu-test --device /dev/dri -v /dev/dri/by-path:/dev/dri/by-path --entrypoint="" xpu-test sh -c '
17-
python3 examples/offline_inference/offline_inference.py
18-
python3 examples/offline_inference/offline_inference_cli.py -tp 2
17+
python3 examples/offline_inference/basic.py
18+
python3 examples/offline_inference/cli.py -tp 2
1919
'

.buildkite/test-pipeline.yaml

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -187,19 +187,19 @@ steps:
187187
- examples/
188188
commands:
189189
- pip install tensorizer # for tensorizer test
190-
- python3 offline_inference/offline_inference.py
190+
- python3 offline_inference/basic.py
191191
- python3 offline_inference/cpu_offload.py
192-
- python3 offline_inference/offline_inference_chat.py
193-
- python3 offline_inference/offline_inference_with_prefix.py
192+
- python3 offline_inference/chat.py
193+
- python3 offline_inference/prefix_caching.py
194194
- python3 offline_inference/llm_engine_example.py
195-
- python3 offline_inference/offline_inference_vision_language.py
196-
- python3 offline_inference/offline_inference_vision_language_multi_image.py
195+
- python3 offline_inference/vision_language.py
196+
- python3 offline_inference/vision_language_multi_image.py
197197
- python3 other/tensorize_vllm_model.py --model facebook/opt-125m serialize --serialized-directory /tmp/ --suffix v1 && python3 other/tensorize_vllm_model.py --model facebook/opt-125m deserialize --path-to-tensors /tmp/vllm/facebook/opt-125m/v1/model.tensors
198-
- python3 offline_inference/offline_inference_encoder_decoder.py
199-
- python3 offline_inference/offline_inference_classification.py
200-
- python3 offline_inference/offline_inference_embedding.py
201-
- python3 offline_inference/offline_inference_scoring.py
202-
- python3 offline_inference/offline_profile.py --model facebook/opt-125m run_num_steps --num-steps 2
198+
- python3 offline_inference/encoder_decoder.py
199+
- python3 offline_inference/classification.py
200+
- python3 offline_inference/embedding.py
201+
- python3 offline_inference/scoring.py
202+
- python3 offline_inference/profiling.py --model facebook/opt-125m run_num_steps --num-steps 2
203203

204204
- label: Prefix Caching Test # 9min
205205
mirror_hardwares: [amd]

docs/source/contributing/profiling/profiling_index.md

Lines changed: 1 addition & 1 deletion

docs/source/features/structured_outputs.md

Lines changed: 1 addition & 1 deletion

0 commit comments

Comments
 (0)