Skip to content

Commit 022f437

Browse files
committed
w8a8-int8 version
1 parent 5a4a836 commit 022f437

File tree

1 file changed

+8
-0
lines changed

1 file changed

+8
-0
lines changed
Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,8 @@
1+
quantization_stage:
2+
quantization_modifiers:
3+
GPTQModifier:
4+
targets: [Linear]
5+
ignore: ['re:.*lm_head', 're:model.visual.*']
6+
weights:
7+
num_bits: 8
8+
type: int

0 commit comments

Comments
 (0)