Skip to content

Commit 5eae0e5

Browse files
mgoinMu Huai
authored andcommitted
[Bugfix] Disable gptq_bitblas for <SM80 to fix GPTQ on V100/T4 (vllm-project#17541)
Signed-off-by: mgoin <[email protected]> Signed-off-by: Mu Huai <[email protected]>
1 parent 052bb02 commit 5eae0e5

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/model_executor/layers/quantization/gptq_bitblas.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -134,7 +134,7 @@ def get_supported_act_dtypes(cls) -> List[torch.dtype]:
134134

135135
@classmethod
136136
def get_min_capability(cls) -> int:
137-
return 70
137+
return 80
138138

139139
@classmethod
140140
def get_config_filenames(cls) -> List[str]:

0 commit comments

Comments
 (0)