Skip to content

[feat/multimodal] support for Qwen2-VL #165

@hiyouga

Description

@hiyouga

🐛 Describe the bug

Example:

modeling_qwen2_vl.apply_rotary_pos_emb = liger_rotary_pos_emb
modeling_qwen2_vl.Qwen2RMSNorm = LigerRMSNorm
modeling_qwen2_vl.CrossEntropyLoss = LigerCrossEntropyLoss
modeling_qwen2_vl.Qwen2VLForConditionalGeneration.forward = qwen2_vl_lce_forward
modeling_qwen2_vl.Qwen2MLP = LigerSwiGLUMLP

The qwen2_vl_lce_forward needs to be implemented to adapt to the image inputs

Reproduce

Na

Versions

Environment Report:

Python version: 3.10.13
PyTorch version: 2.2.0+cu118
CUDA version: 11.8
Triton: Not installed
Transformers version: 4.45.0.dev0

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions