[Feature]: Support image embeddings as input for qwen2vl

### 🚀 The feature, motivation and pitch


Most multimodal models support input image embeddings. see previous pr: https://github.com/vllm-project/vllm/pull/6613
IMO there's no reason not to support qwen2vl.

When I was about to add this feature to qwen2vl. Unfortunately, I've run into some difficulties.
For example, I can't just rely on image embedding to generate new prompt_token_ids without the original image. See [here](https://github.com/vllm-project/vllm/blob/4bb98f2190aaf408cb063df5184829fb54ee5f81/vllm/model_executor/models/qwen2_vl.py#L788.)

    height, width = get_image_size(image, channel_dim=input_data_format)
And [here](https://github.com/vllm-project/vllm/blob/4bb98f2190aaf408cb063df5184829fb54ee5f81/vllm/model_executor/models/qwen2_vl.py#L564C5-L564C33), if we just return image embeds, it will occur an error. AssertionError: mrope embedding type requires multi-modal input mapper returns 'image_grid_thw' or 'video_grid_thw'.

Might we need to passthrough more parameters for qwen2vl? please me give some tips.
here is my draft code: https://github.com/vllm-project/vllm/pull/8856

### Alternatives

_No response_

### Additional context

_No response_

### Before submitting a new issue...

- [X] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Feature]: Support image embeddings as input for qwen2vl #8857

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Feature]: Support image embeddings as input for qwen2vl #8857

Description

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions