Skip to content

[Example] [Bugfix] Fix Gemma3 Generation #1517

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jun 6, 2025

Conversation

kylesayrs
Copy link
Collaborator

Purpose

  • Fix Gemma3 example

Background

Changes

  • Disable compilation when generating with Gemma3 in example

Signed-off-by: Kyle Sayers <[email protected]>
Copy link
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

Copy link

github-actions bot commented Jun 6, 2025

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

Copy link
Collaborator

@rahul-tuli rahul-tuli left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good catch!

@kylesayrs kylesayrs added the ready When a PR is ready for review label Jun 6, 2025
@kylesayrs kylesayrs enabled auto-merge (squash) June 6, 2025 16:41
@kylesayrs kylesayrs merged commit 341e27c into main Jun 6, 2025
12 checks passed
@kylesayrs kylesayrs deleted the kylesayrs/patch-gemma3-example branch June 6, 2025 16:41
dsikka pushed a commit that referenced this pull request Jun 16, 2025
## Purpose ##
* Fix gemma2 generation
* See #1517

Signed-off-by: Kyle Sayers <[email protected]>
brian-dellabetta pushed a commit that referenced this pull request Jun 30, 2025
## Purpose ##
* Fix generation for gemma models
* See: #1517

## Changes ##
* Disable compilation for remaining gemma models

---------

Signed-off-by: Kyle Sayers <[email protected]>
kylesayrs added a commit that referenced this pull request Jun 30, 2025
## Purpose ##
* Fix generation for gemma models
* See: #1517

## Changes ##
* Disable compilation for remaining gemma models

---------

Signed-off-by: Kyle Sayers <[email protected]>
aireilly pushed a commit to aireilly/llm-compressor that referenced this pull request Jul 30, 2025
## Purpose ##
* Fix Gemma3 example

## Background ##
* Transformers `torch.compile`s the model function before running
generation. However, some models fail to compile properly. As of the
latest transformers version, Gemma3 is one of those models
huggingface/transformers#38333

## Changes ##
* Disable compilation when generating with Gemma3 in example

Signed-off-by: Kyle Sayers <[email protected]>
aireilly pushed a commit to aireilly/llm-compressor that referenced this pull request Jul 30, 2025
## Purpose ##
* Fix gemma2 generation
* See vllm-project#1517

Signed-off-by: Kyle Sayers <[email protected]>
aireilly pushed a commit to aireilly/llm-compressor that referenced this pull request Jul 30, 2025
## Purpose ##
* Fix generation for gemma models
* See: vllm-project#1517

## Changes ##
* Disable compilation for remaining gemma models

---------

Signed-off-by: Kyle Sayers <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ready When a PR is ready for review
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants