-
-
Notifications
You must be signed in to change notification settings - Fork 10.4k
[Bugfix][v1] fixed llava-hf/llava-1.5-7b-hf is broken on V1 #14554
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
👋 Hi! Thank you for contributing to the vLLM project. 💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels. Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can either: Add 🚀 |
Signed-off-by: chaunceyjiang <[email protected]>
Test # VLLM_USE_V1=1 python3 examples/offline_inference/vision_language.py -m llava
...
...
INFO 03-10 11:29:52 [gpu_model_runner.py:1416] Graph capturing finished in 20 secs, took 1.42 GiB
INFO 03-10 11:29:52 [core.py:120] init engine (profile, create kv cache, warmup model) took 42.62 seconds
Using a slow image processor as `use_fast` is unset and a slow processor was saved with this model. `use_fast=True` will be the default behavior in v4.48, even if the model was saved with a slow processor. This will result in minor differences in outputs. You'll still be able to use a slow processor with `use_fast=False`.
Processed prompts: 100%|████████████████████████████████████████████████████████████████████████████| 4/4 [00:00<00:00, 4.44it/s, est. speed input: 2646.97 toks/s, output: 284.24 toks/s]
The image features a tall tower with a spire, surrounded by a beautiful flowering tree. The tree is filled with pink flowers, creating a picturesque scene. The tower stands tall in the background, with the tree's branches extending towards it. The combination of the tower and the flowering tree creates a
The image features a tall tower with a spire, surrounded by a forest of pink flowers. The tower stands tall amidst the vibrant blossoms, creating a picturesque scene. The tower's height and the abundance of flowers create a sense of grandeur and beauty in the landscape.
The image features a tall tower with a spire, surrounded by a beautiful cherry blossom tree. The tree is filled with pink flowers, creating a stunning contrast against the tower's structure. The tower stands tall in the background, with the tree's branches extending towards it. The scene capt
The image features a tall building with a spire, surrounded by a beautiful flowering tree filled with pink flowers. The tree is located in front of the building, creating a striking contrast between the architectural structure and the natural beauty of the flowers. The scene captures the essence of harmony between urban architecture |
@DarkLight1337 @lk-chen PTAL. |
|
Signed-off-by: DarkLight1337 <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for fixing!
Thanks~ |
Thanks for the fix! |
Thanks! |
…ject#14554) Signed-off-by: chaunceyjiang <[email protected]> Signed-off-by: DarkLight1337 <[email protected]> Co-authored-by: DarkLight1337 <[email protected]> Signed-off-by: Louis Ulmer <[email protected]>
…ject#14554) Signed-off-by: chaunceyjiang <[email protected]> Signed-off-by: DarkLight1337 <[email protected]> Co-authored-by: DarkLight1337 <[email protected]>
FIX #14523
vllm/vllm/model_executor/models/llava.py
Lines 386 to 395 in 5d80252
The root cause of the issue is that PR #14275 introduced additional parameters
feat_is_patch
andembed_is_patch
forPixtralHFMultiModalProcessor
, but these two parameters are not present in theLlavaMultiModalProcessor
.vllm/vllm/model_executor/models/llava.py
Lines 298 to 310 in 5d80252