Skip to content

Commit 84469d6

Browse files
authored
[Inference] Qwen2 support fp8 inference (#8954)
* qwen2 fp8 * fp8 check * fp8 cutlass * int8 cachekv * a8w8c8_fp8
1 parent a275ab7 commit 84469d6

File tree

2 files changed

+451
-55
lines changed

2 files changed

+451
-55
lines changed

0 commit comments

Comments
 (0)