Skip to content

[Bugfix][Tracing] Fix qwen2_5_vl #1448

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
May 19, 2025
Merged

Conversation

kylesayrs
Copy link
Collaborator

Purpose

  • Fix failing qwen_2_5_vl e2e test

Changes

  • For recipes that run smoothquant before gptq, these recipes would populate self.rope_deltas. The existence of self.rope_deltas meant that subsequent tracing calls would not trigger the condition to inject tracing metadata, leading to a tracing failure

Testing

  • CADENCE=weekly TEST_DATA_FILE=tests/lmeval/configs/vl_int8_w8a8_dynamic_per_token.yaml python3 -m pytest tests/lmeval/test_lmeval.py -sx

Copy link

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

@kylesayrs kylesayrs added the ready When a PR is ready for review label May 19, 2025
@dsikka dsikka enabled auto-merge (squash) May 19, 2025 16:03
@dsikka dsikka merged commit 5edf565 into main May 19, 2025
11 checks passed
@dsikka dsikka deleted the kylesayrs/fix-qwen2_5_vl_tracing branch May 19, 2025 17:25
aireilly pushed a commit to aireilly/llm-compressor that referenced this pull request Jul 30, 2025
## Purpose ##
* Fix failing qwen_2_5_vl e2e test

## Changes ##
* For recipes that run smoothquant before gptq, these recipes would
populate `self.rope_deltas`. The existence of `self.rope_deltas` meant
that subsequent tracing calls would not trigger [the condition to inject
tracing
metadata](https://github.com/vllm-project/llm-compressor/blob/main/src/llmcompressor/transformers/tracing/qwen2_5_vl.py#L406),
leading to a tracing failure

## Testing ##
* `CADENCE=weekly
TEST_DATA_FILE=tests/lmeval/configs/vl_int8_w8a8_dynamic_per_token.yaml
python3 -m pytest tests/lmeval/test_lmeval.py -sx`

Signed-off-by: Kyle Sayers <[email protected]>
Co-authored-by: Dipika Sikka <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ready When a PR is ready for review
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants