Pinned Loading
-
AutoSmoothQuant
AutoSmoothQuant PublicAn easy-to-use package for implementing SmoothQuant for LLMs
-
sgl-project/sglang
sgl-project/sglang PublicSGLang is a fast serving framework for large language models and vision language models.
-
volcengine/verl
volcengine/verl Publicverl: Volcano Engine Reinforcement Learning for LLMs
-
HandH1998/QQQ
HandH1998/QQQ PublicQQQ is an innovative and hardware-optimized W4A8 quantization solution for LLMs.
-
vllm
vllm PublicForked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
-
torch-int
torch-int PublicForked from Guangxuan-Xiao/torch-int
This repository contains integer operators on GPUs for PyTorch.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.