Skip to content

Commit 96066f8

Browse files
committed
loading dO and unroll seems to be incompatible with each other.
1 parent 8f2406a commit 96066f8

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

lib/nnc/mfa/v2/AttentionDescriptor.cpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -505,7 +505,7 @@ std::vector<AttentionParameterRow> AttentionDescriptor::forward(MTL::Device *con
505505
std::vector<AttentionParameterRow> AttentionDescriptor::backwardQueryMixed(MTL::Device *const device) const noexcept {
506506
if (device->supportsFamily(MTL::GPUFamily(1009))) {
507507
return {
508-
AttentionParameterRow(80, 16, 64, 8, { AttentionOperand::Q, AttentionOperand::dO, AttentionOperand::dQ }),
508+
AttentionParameterRow(80, 16, 64, 8, { AttentionOperand::Q, AttentionOperand::dQ }),
509509
AttentionParameterRow(192, 16, 64, 32, { AttentionOperand::Q, AttentionOperand::dQ }),
510510
AttentionParameterRow(384, 16, 128, 32, {})
511511
};

0 commit comments

Comments
 (0)