Skip to content

[Bugfix] Fix KV head calculation for MPT models when using GQA#5142

Merged
WoosukKwon merged 1 commit intovllm-project:mainfrom
bfontain:main
Jun 17, 2024
Merged

[Bugfix] Fix KV head calculation for MPT models when using GQA#5142
WoosukKwon merged 1 commit intovllm-project:mainfrom
bfontain:main

Commits

Commits on May 30, 2024