Skip to content

Commit a9d02c6

Browse files
LucasWilkinsonDamonFool
authored andcommitted
[BugFix] Illegal Memory Access in the blockwise cutlass fp8 GEMMs (vllm-project#14396)
1 parent d776f78 commit a9d02c6

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

csrc/cutlass_extensions/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -402,7 +402,7 @@ struct CollectiveMma<
402402

403403
// TODO: test `scale_copy_a` with `ScaleMsPerTile` < 128
404404
TiledCopy scale_copy_a = make_tiled_copy(SmemBlockScalingCopyAtomA{},
405-
Layout<Shape<_32, _1>>{}, Layout<Shape<_4, _1>>{}); // (1,1,1)
405+
Layout<Shape<_32>>{}, Layout<Shape<_1>>{}); // (1,1,1)
406406
TiledCopy scale_copy_b = make_tiled_copy(SmemBlockScalingCopyAtomB{},
407407
Layout<Shape<_1>>{}, Layout<Shape<_1>>{}); // (1,1,1)
408408
ThrCopy thr_scale_copy_a = scale_copy_a.get_slice(threadIdx.x);

0 commit comments

Comments
 (0)