Skip to content

Pull requests: ROCm/aiter

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[TRITON] Fix fp8 bmm op unit test bug on MI350
#1219 opened Oct 17, 2025 by lucas-santos-amd Loading…
1 task
[Triton] [355_wip] DS 355_wip fused_shared_expert
#1217 opened Oct 17, 2025 by k50112113 Loading…
add several f4 tuned config for fw shapes
#1216 opened Oct 17, 2025 by hongxiayang Loading…
Add mha varlen fake for different from mha
#1214 opened Oct 17, 2025 by ZhangLirong-amd Loading…
1 task
[CK_TILE] fmha: Add backward pass support for padded inputs
#1212 opened Oct 17, 2025 by Jeff-Huang Loading…
1 task
update mi308 fmoe fp16 asm
#1201 opened Oct 15, 2025 by amd-ruitang3 Loading…
1 task
Tune gemm op bf16
#1190 opened Oct 14, 2025 by yzhou103 Loading…
1 task
[Triton] triton fp4 gemm preshuffle PR to 355_wip
#1185 opened Oct 13, 2025 by k50112113 Loading…
[CK_TILE] FMHA BWD Optimizations for D48 for GFX950
#1180 opened Oct 13, 2025 by DDEle Loading…
1 task done
[MI35X] Enhance mha bwd varlen kernels
#1179 opened Oct 13, 2025 by slippedJim Loading…
1 task
fix torch compile when using fp8 fla
#1177 opened Oct 13, 2025 by guangzlu Loading…
1 task
add define
#1168 opened Oct 11, 2025 by yixionghuo Loading…
1 task
refactor mha fwd and bwd args
#1165 opened Oct 11, 2025 by minmengdie Loading…
1 task
CI: Operators tuning pipelines
#1163 opened Oct 11, 2025 by gyohuangxin Loading…
A8w8 asm codegen and tune
#1161 opened Oct 11, 2025 by yzhou103 Loading…
1 task
ProTip! Filter pull requests by the default branch with base:main.