Skip to content

Commit faca3a3

Browse files
authored
fix typo quantization (#2469)
1 parent 7aace7d commit faca3a3

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

fastchat/serve/vllm_worker.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -192,6 +192,7 @@ async def api_model_details(request: Request):
192192
"--controller-address", type=str, default="http://localhost:21001"
193193
)
194194
parser.add_argument("--model-path", type=str, default="lmsys/vicuna-7b-v1.3")
195+
parser.add_argument("--quantization", type=str)
195196
parser.add_argument(
196197
"--model-names",
197198
type=lambda s: s.split(","),

0 commit comments

Comments
 (0)