Skip to content

Pull requests: ROCm/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Model] Add GPT-OSS model code and config
#625 opened Aug 7, 2025 by ashishtanwer Loading…
[FEAT] Support Triton MRoPE kernel
#621 opened Aug 6, 2025 by tjtanaavllm Loading…
4 tasks
add Fused_rms_quant for deepseek_v2 model
#611 opened Jul 29, 2025 by ZJLi2013 Loading…
[FEAT] [ROCm] Shared Experts Aiter
#605 opened Jul 25, 2025 by tjtanaavllm Loading…
add fused fp8 bmm
#604 opened Jul 25, 2025 by k50112113 Loading…
Update fp8 paged attention
#592 opened Jul 9, 2025 by amd-xiaoyu12 Draft
Update test-template.j2
#579 opened Jun 16, 2025 by okakarpa Loading…
Disable skynny gemms by default
#568 opened Jun 5, 2025 by k-artem Loading…
Patch to run AITER 0507 stale
#541 opened May 8, 2025 by qli88 Loading…
Remap fp8 kv-scale names for Deepseek stale
#535 opened May 1, 2025 by sstamenk Loading…
Updated README.md with April 29 results stale
#526 opened Apr 27, 2025 by Mcirino1 Loading…
BF16 Skinny Optimization stale
#520 opened Apr 22, 2025 by amd-hhashemi Loading…
Enable RPD Profiler in OpenAI server stale
#513 opened Apr 15, 2025 by rebklee Loading…
Test Queues
#456 opened Feb 28, 2025 by dhonnappa-amd Draft
Enable custom paged attention kernel for Navi 3/4
#446 opened Feb 24, 2025 by hyoon1 Loading…
ProTip! Exclude everything labeled bug with -label:bug.