Skip to content

Commit bbc24c7

Browse files
Update measure_ppl2_MC.py
Adding functionality to ingest scaling factors upon merge of the PR vllm-project#3290
1 parent 1e2b203 commit bbc24c7

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

benchmarks/measure_ppl2_MC.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -72,6 +72,8 @@ def vllm_init(args):
7272
kv_cache_dtype=args.kv_cache_dtype,
7373
#scales_path=args.kv_cache_scales_path
7474
# if args.kv_cache_scales_path!='' else None,
75+
quantization-param-path=args.kv_cache_scales_path
76+
if args.kv_cache_scales_path!='' else None,
7577
enforce_eager=args.enforce_eager)
7678

7779
sampling_params = SamplingParams(n=1,

0 commit comments

Comments
 (0)