You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
## Purpose ##
* Fix failing qwen_2_5_vl e2e test
## Changes ##
* For recipes that run smoothquant before gptq, these recipes would
populate `self.rope_deltas`. The existence of `self.rope_deltas` meant
that subsequent tracing calls would not trigger [the condition to inject
tracing
metadata](https://github.com/vllm-project/llm-compressor/blob/main/src/llmcompressor/transformers/tracing/qwen2_5_vl.py#L406),
leading to a tracing failure
## Testing ##
* `CADENCE=weekly
TEST_DATA_FILE=tests/lmeval/configs/vl_int8_w8a8_dynamic_per_token.yaml
python3 -m pytest tests/lmeval/test_lmeval.py -sx`
Signed-off-by: Kyle Sayers <[email protected]>
Co-authored-by: Dipika Sikka <[email protected]>
0 commit comments