[feat/multimodal] support for Qwen2-VL

### 🐛 Describe the bug

Example:

```python
modeling_qwen2_vl.apply_rotary_pos_emb = liger_rotary_pos_emb
modeling_qwen2_vl.Qwen2RMSNorm = LigerRMSNorm
modeling_qwen2_vl.CrossEntropyLoss = LigerCrossEntropyLoss
modeling_qwen2_vl.Qwen2VLForConditionalGeneration.forward = qwen2_vl_lce_forward
modeling_qwen2_vl.Qwen2MLP = LigerSwiGLUMLP
```

The `qwen2_vl_lce_forward` needs to be implemented to adapt to the image inputs

### Reproduce

Na

### Versions

Environment Report:
-------------------
Python version: 3.10.13
PyTorch version: 2.2.0+cu118
CUDA version: 11.8
Triton: Not installed
Transformers version: 4.45.0.dev0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[feat/multimodal] support for Qwen2-VL #165

🐛 Describe the bug

Reproduce

Versions

Environment Report:

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[feat/multimodal] support for Qwen2-VL #165

Description

🐛 Describe the bug

Reproduce

Versions

Environment Report:

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions