Support SGLang As Serving Engine #33

zhaochenyang20 · 2024-10-30T00:19:32Z

I added SGLang as an additional serving engine for Qwen models. Thus I changed the installation process to install vllm and sglang[all] together.

The comparison on my local machine is attached here:

All in all, SGLang is as well as vllm considering the accuracy and much faster as we expected. Thanks!

zhaochenyang20 added 4 commits October 29, 2024 12:35

add sglang support

04cfa2d

add sglang support

e70be8c

add visual

a32dfcc

add sglang support

2274a21

zhaochenyang20 changed the title ~~Support As Serving Engine~~ Support SGLang As Serving Engine Nov 3, 2024

Provide feedback