-
Notifications
You must be signed in to change notification settings - Fork 729
Open
Labels
bugSomething isn't workingSomething isn't working
Description
deploy-dynamo-vllm-aggregated.yaml
Describe the Bug
We are rying to do aggregated deployment on vllm backend for Llama3.2-3B model (with 1-2 GPUs). We are getting “503 Service Unavailable” with default configuration tried.
Attaching the issue details along with deployment yaml file used for this deployment.
Followed the example of – [https://github.com/ai-dynamo/dynamo/blob/c9d7d95f4be01e6352c51196ed1858ddcb03fce5/examples/backends/vllm/deploy/agg_router.yaml]
Steps to Reproduce
Expected Behavior
Run without issuess
Actual Behavior
Server issue
Environment
DGX H100
Additional Context
No response
Screenshots
No response
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working