[Bugfix] Fix KV head calculation for MPT models when using GQA#5142
Merged
WoosukKwon merged 1 commit intovllm-project:mainfrom Jun 17, 2024
Merged
[Bugfix] Fix KV head calculation for MPT models when using GQA#5142WoosukKwon merged 1 commit intovllm-project:mainfrom
WoosukKwon merged 1 commit intovllm-project:mainfrom
Commits
Commits on May 30, 2024
- committed