[Triton] [355_wip_triton] DS 355_wip fused_shared_expert #1218

k50112113 · 2025-10-17T15:48:00Z

this PR adds 2 fused triton kernels:

fused_gemm_a8w8_blockscale_a16w16
fused_reduce_act_mul_fp8_group_quant

These two kernels are designed for fused shared experts for DSV3 on vLLM

same changes are PRed to 355_wip as well: #1217

documentation, fix some bugs, UT

k50112113 mentioned this pull request Oct 17, 2025

[355_wip] triton fusion optimized fused_shared_experts ROCm/vllm#741

Merged

DS 355_wip fused_shared_expert

99fda8d

documentation, fix some bugs, UT

k50112113 force-pushed the shaoclee/355_wip_triton_fused_shared_expert branch from ba165a5 to 99fda8d Compare October 22, 2025 22:39

k50112113 merged commit c0e8d91 into 355_wip_triton Oct 22, 2025
3 of 5 checks passed

k50112113 deleted the shaoclee/355_wip_triton_fused_shared_expert branch October 22, 2025 22:40

k50112113 added a commit that referenced this pull request Oct 23, 2025

DS 355_wip fused_shared_expert (#1218)

3da6594

documentation, fix some bugs, UT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Triton] [355_wip_triton] DS 355_wip fused_shared_expert #1218

[Triton] [355_wip_triton] DS 355_wip fused_shared_expert #1218

Uh oh!

k50112113 commented Oct 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[Triton] [355_wip_triton] DS 355_wip fused_shared_expert #1218

[Triton] [355_wip_triton] DS 355_wip fused_shared_expert #1218

Uh oh!

Conversation

k50112113 commented Oct 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant