Skip to content

Conversation

@lanking520
Copy link
Contributor

@lanking520 lanking520 commented Jul 22, 2024

Description

Add new quantization kernel to vLLM/LMI-Dist

Also adding CPU offloading option to GPUs

@lanking520 lanking520 requested review from a team, frankfliu and zachgk as code owners July 22, 2024 21:24
@lanking520 lanking520 changed the title [vLLM] add new kernels to engine [vLLM] add new configs to engine Jul 22, 2024
@lanking520 lanking520 changed the title [vLLM] add new configs to engine [vLLM][0.5.3] add new configs to engine Jul 23, 2024
fix to 0
@lanking520 lanking520 merged commit 6d555b8 into deepjavalibrary:master Jul 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants