[E2E Testing] KV-Cache #1004

horheynm · 2024-12-20T05:16:50Z

~~Contingent on merge of vllm-project/vllm#11354~~
^ merged

SUMMARY:
Add kv-cache e2e testing

One small model - tinyllama - with kv-cache
One small model - tinyllama - with kv-cache + gptq
Fused Model - phi3 - with kv-cache

github-actions · 2024-12-20T05:17:02Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

horheynm · 2024-12-20T16:08:44Z

tests/e2e/vLLM/configs/kv_cache_gptq_tinyllama.yaml

@@ -0,0 +1,7 @@
+cadence: "nightly"
+test_type: "regression"
+model: TinyLlama/TinyLlama-1.1B-Chat-v1.0


Other models/size and recipes are welcome.
currently targeting small model and fused layer model

kylesayrs · 2025-01-03T01:29:54Z

Can approve if nightly passes

dsikka

Does the nightly pass @horheynm ?

~~Contingent on merge of vllm-project/vllm#11354 ^ merged SUMMARY: Add kv-cache e2e testing * One small model - tinyllama - with kv-cache * One small model - tinyllama - with kv-cache + gptq * Fused Model - phi3 - with kv-cache Signed-off-by: Kyle Sayers <[email protected]>

configs

435cd09

horheynm commented Dec 20, 2024

View reviewed changes

horheynm changed the title ~~configs~~ [Testing e2e] KV-Cache Dec 23, 2024

Merge branch 'main' into e2e-kv-cache

a79d174

horheynm changed the title ~~[Testing e2e] KV-Cache~~ [E2E Testing] KV-Cache Dec 23, 2024

Merge branch 'main' into e2e-kv-cache

50c60ce

horheynm marked this pull request as ready for review December 23, 2024 15:12

Merge branch 'main' into e2e-kv-cache

5c31f8d

kylesayrs approved these changes Jan 9, 2025

View reviewed changes

dsikka reviewed Jan 10, 2025

View reviewed changes

dsikka merged commit 120ac57 into main Jan 10, 2025
6 of 7 checks passed

dsikka deleted the e2e-kv-cache branch January 10, 2025 14:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[E2E Testing] KV-Cache #1004

[E2E Testing] KV-Cache #1004

Uh oh!

horheynm commented Dec 20, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Dec 20, 2024

Uh oh!

horheynm Dec 20, 2024

Uh oh!

kylesayrs commented Jan 3, 2025

Uh oh!

dsikka left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

[E2E Testing] KV-Cache #1004

[E2E Testing] KV-Cache #1004

Uh oh!

Conversation

horheynm commented Dec 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Dec 20, 2024

Uh oh!

horheynm Dec 20, 2024

Choose a reason for hiding this comment

Uh oh!

kylesayrs commented Jan 3, 2025

Uh oh!

dsikka left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

horheynm commented Dec 20, 2024 •

edited

Loading

dsikka left a comment •

edited

Loading