Skip to content
This repository was archived by the owner on Oct 11, 2024. It is now read-only.

Commit 010f2e8

Browse files
tlrmchlsmthRobert Shaw
authored andcommitted
[Bugfix] Fix CUDA version check for mma warning suppression (vllm-project#5642)
1 parent c5ef2f9 commit 010f2e8

File tree

1 file changed

+2
-1
lines changed
  • csrc/quantization/marlin/sparse/common

1 file changed

+2
-1
lines changed

csrc/quantization/marlin/sparse/common/mma.h

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,6 +17,7 @@
1717

1818
#pragma once
1919
#include "base.h"
20+
#include <cudaTypedefs.h>
2021

2122
namespace marlin_24 {
2223

@@ -26,7 +27,7 @@ namespace marlin_24 {
2627
// | Advisory: Modifier ‘.sp::ordered_metadata’ should be used on instruction
2728
// | ‘mma’ instead of modifier ‘.sp’ as it is expected to have substantially
2829
// | reduced performance on some future architectures
29-
#if defined CUDA_VERSION && CUDA_VERSION >= 12500
30+
#if defined CUDA_VERSION && CUDA_VERSION >= 12050
3031
#define MMA_SP_INST \
3132
"mma.sp::ordered_metadata.sync.aligned.m16n8k32.row.col.f32.f16.f16.f32 "
3233
#else

0 commit comments

Comments
 (0)