about vllm healthy check on k8s

Deploying vllm on k8s For service stability, healthy check is configured

1. TCP check
2. real llm request
3. healthy check api

1. tcp is a little simple
2. real request relative resource consumption

Can we add a simple calculation to provide check as follows

```
@app.get("/healthz")
async def health_check():
    """Health check"""
    # a simple compute
```

if it works, I would like to contribute this pr





Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

about vllm healthy check on k8s #1343

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

about vllm healthy check on k8s #1343

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions