ROCm / vllm Public

forked from vllm-project/vllm

Notifications You must be signed in to change notification settings
Fork 48
Star 102

Code
Issues 5
Pull requests 29
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: ROCm/vllm

Labels 14 Milestones 0

New pull request New

29 Open 658 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[Perf] Add fused qk rotary embedding kernel for Qwen2.5-VL ViT

#715 opened Sep 26, 2025 by kliuae-amd

Loading…

5 tasks

[Perf] refactor attention backend for perf boost

#713 opened Sep 26, 2025 by ganyi1996ppo

Loading…

5 tasks

fix hidden pad

#712 opened Sep 26, 2025 by zhiding512

Loading…

add hipblas in Docker build

#708 opened Sep 25, 2025 by dllehr-amd

Loading…

5 tasks

[355_wip] Let dynamo capture rms/silu_mul+f4gemm pattern

#705 opened Sep 24, 2025 by xytpai

Loading…

[ROCm] Add allreduce dispatcher for ROCm device

#704 opened Sep 24, 2025 by zejunchen-zejun

Loading…

Qwen-next script

#702 opened Sep 24, 2025 by ZhiweiYan-96

Loading…

5 tasks

[ROCm] Add allreduce dispatcher for ROCm device

#695 opened Sep 18, 2025 by zejunchen-zejun

Loading…

[ROCm] warpSize is being made non constexpr in ROCm 7.0 (#20330)

#694 opened Sep 18, 2025 by xudonlyu

Loading…

Zhimding/355 wip

#675 opened Sep 12, 2025 by coderfeli

Loading…

5 tasks

[355_wip] Let inductor capture silu+mul+quant pattern and replace them with aiter operator

#669 opened Sep 11, 2025 by xytpai

Loading…

support ck-tile fused bias gemm for rocm unquantized gemm

#668 opened Sep 11, 2025 by eliotwang

Loading…

support rocblas for rocm_unquantized_gemm

#665 opened Sep 10, 2025 by eliotwang

Loading…

add fp8 gemm path choice for rocm_aiter_gemm_w8a8_blockscale

#659 opened Sep 8, 2025 by zhuyuhua-v

Loading…

Add cache config for gpt oss

#656 opened Sep 5, 2025 by cagrikymk • Draft

[NOT FOR LANDING] 355_wip_0909_rc2 -> 0909_rc2

#654 opened Sep 4, 2025 by maleksan85 • Draft

fix flashmla metadata build calls()

#636 opened Aug 19, 2025 by ZJLi2013

Loading…

Updated README.md for August 12 RC2 throughput results only

#631 opened Aug 13, 2025 by Mcirino1

Loading…

[Model] Add GPT-OSS model code and config

#625 opened Aug 7, 2025 by ashishtanwer

Loading…

add Fused_rms_quant for deepseek_v2 model

#611 opened Jul 29, 2025 by ZJLi2013

Loading…

add fused fp8 bmm

#604 opened Jul 25, 2025 by k50112113

Loading…

Update fp8 paged attention

#592 opened Jul 9, 2025 by amd-xiaoyu12 • Draft

Update test-template.j2

#579 opened Jun 16, 2025 by okakarpa

Loading…

Disable skynny gemms by default unstale

#568 opened Jun 5, 2025 by k-artem

Loading…

Test Queues

#456 opened Feb 28, 2025 by dhonnappa-amd • Draft

Previous 1 2 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!