Skip to content

[BUG]: 503 service unavailable #4798

@pgowda1107

Description

@pgowda1107

deploy-dynamo-vllm-aggregated.yaml

Dynamo-issue.txt

Describe the Bug

We are rying to do aggregated deployment on vllm backend for Llama3.2-3B model (with 1-2 GPUs). We are getting “503 Service Unavailable” with default configuration tried.
Attaching the issue details along with deployment yaml file used for this deployment.
Followed the example of – [https://github.com/ai-dynamo/dynamo/blob/c9d7d95f4be01e6352c51196ed1858ddcb03fce5/examples/backends/vllm/deploy/agg_router.yaml]

Steps to Reproduce

https://github.com/ai-dynamo/dynamo/blob/c9d7d95f4be01e6352c51196ed1858ddcb03fce5/examples/backends/vllm/deploy/agg_router.yaml

Expected Behavior

Run without issuess

Actual Behavior

Server issue

Environment

DGX H100

Additional Context

No response

Screenshots

No response

Metadata

Metadata

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions