forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 48
Pull requests: ROCm/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Perf] Add fused qk rotary embedding kernel for Qwen2.5-VL ViT
#715
opened Sep 26, 2025 by
kliuae-amd
Loading…
5 tasks
[Perf] refactor attention backend for perf boost
#713
opened Sep 26, 2025 by
ganyi1996ppo
Loading…
5 tasks
[355_wip] Let dynamo capture rms/silu_mul+f4gemm pattern
#705
opened Sep 24, 2025 by
xytpai
Loading…
[ROCm] Add allreduce dispatcher for ROCm device
#704
opened Sep 24, 2025 by
zejunchen-zejun
Loading…
[ROCm] Add allreduce dispatcher for ROCm device
#695
opened Sep 18, 2025 by
zejunchen-zejun
Loading…
[ROCm] warpSize is being made non constexpr in ROCm 7.0 (#20330)
#694
opened Sep 18, 2025 by
xudonlyu
Loading…
[355_wip] Let inductor capture silu+mul+quant pattern and replace them with aiter operator
#669
opened Sep 11, 2025 by
xytpai
Loading…
support ck-tile fused bias gemm for rocm unquantized gemm
#668
opened Sep 11, 2025 by
eliotwang
Loading…
add fp8 gemm path choice for rocm_aiter_gemm_w8a8_blockscale
#659
opened Sep 8, 2025 by
zhuyuhua-v
Loading…
Updated README.md for August 12 RC2 throughput results only
#631
opened Aug 13, 2025 by
Mcirino1
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.