We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent ab3a5a8 commit 86fd8bbCopy full SHA for 86fd8bb
vllm/entrypoints/api_server.py
@@ -1,3 +1,9 @@
1
+"""
2
+NOTE: This API server is used only for demonstrating usage of AsyncEngine and simple performance benchmarks.
3
+It is not intended for production use. For production use, we recommend using our OpenAI compatible server.
4
+We are also not going to accept PRs modifying this file, please change `vllm/entrypoints/openai/api_server.py` instead.
5
6
+
7
import argparse
8
import json
9
from typing import AsyncGenerator
0 commit comments