-
Notifications
You must be signed in to change notification settings - Fork 285
Enable QWen VL video preprocess #2514
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
xipingyan
wants to merge
58
commits into
openvinotoolkit:master
Choose a base branch
from
xipingyan:xp/enable_qwen_vl_video_preprocess
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+1,038
−221
Open
Changes from 8 commits
Commits
Show all changes
58 commits
Select commit
Hold shift + click to select a range
6c49dc8
Avoid to do resize for same width and height images.
xipingyan c7d9932
Enable video process for qwen*-vl
xipingyan 2ee043f
Add python interface: generate config: is_video, default false.
xipingyan 29c74fd
fallback video_encode to image encode in base class.
xipingyan 78dac29
Update calc target image size.
xipingyan 7b2c115
Reduce shared codes, fallback to image process via return empty vector;
xipingyan 10d8e8d
1: remove is_video,
xipingyan a3000d4
Update src/cpp/src/visual_language/llava/classes.cpp
xipingyan 062fc40
Merge branch 'master' into xp/enable_qwen_vl_video_preprocess
xipingyan 4d8375d
Update src/cpp/src/visual_language/pipeline.cpp
xipingyan ef9f868
rename according to copilot suggestion
xipingyan ad95828
Merge branch 'xp/enable_qwen_vl_video_preprocess' of https://github.c…
xipingyan f92b19b
rename rgbs to images
xipingyan 66cdf38
enable if node to unify image and video preprocess.
xipingyan 3eda036
cpp preprocess: enable video preprecess.
xipingyan 3df267f
Pass same_images
xipingyan bf3169b
add commments for same image
xipingyan e1250aa
Update loop condition, and rename variables.
xipingyan fe0ab92
Update src/cpp/src/visual_language/pipeline_base.hpp
xipingyan dec67b2
video should be frames.
xipingyan caee3fd
Add pytest for video input.
xipingyan 6a49a48
Merge branch 'master' into xp/enable_qwen_vl_video_preprocess
xipingyan 800638e
Merge branch 'master' into xp/enable_qwen_vl_video_preprocess
peterchen-intel 1502b28
Remove is_video python attribute.
xipingyan 4d8e867
rename video to videos
xipingyan ea7fc94
Update docs, and add video for add_request.
xipingyan 60364bf
Fix docs format.
xipingyan 4ea5b3d
Fix test error: can't catch exception.
xipingyan 8a0ab2e
Fix: cannot be narrowed from type 'int' to 'float' in initializer list
xipingyan 28337ea
Support no image or video input;
xipingyan f3fd7d4
Add checking input for python api.
xipingyan a80d28e
cpp interface: generate, remove video. add is_video, default false
xipingyan 6ab0a35
update get_inputs_embeds_with_token_type_ids and get_inputs_embeds, i…
xipingyan c531982
Merge branch 'master' into xp/enable_qwen_vl_video_preprocess
xipingyan dc30ec1
update pyi interface of generate.
xipingyan 5edf0a5
Remove "const bool& is_video" in add_request and generate.
xipingyan 2215f8a
Update src/cpp/src/visual_language/qwen2vl/classes.cpp
xipingyan 14352a7
Update src/python/openvino_genai/py_openvino_genai.pyi
xipingyan 89afa54
copilot give a wrong suggestion. add images and video param for add_r…
xipingyan 3b5c6cd
Merge remote-tracking branch 'origin/master' into xp/enable_qwen_vl_v…
xipingyan 8768795
Add examples to .md
xipingyan be57bf2
Fix test video error, and input multiple images.
xipingyan d96c5dd
Update test based on 4D video.
xipingyan aaf20b0
Add vlm test dependency: opencv-python
xipingyan a2ad61b
Merge remote-tracking branch 'origin/master' into xp/enable_qwen_vl_v…
xipingyan 6f5189b
Enable mix video and image input.
xipingyan c0829a3
split encode_images into encode_images and encode_video
xipingyan f25770b
Remove:
xipingyan 72c621b
1: Add <video_pad> placeholder,
xipingyan 132b228
Update position_ids after enable video.
xipingyan 8c0e13d
add video histry id.
xipingyan 64ba684
Update src/cpp/include/openvino/genai/visual_language/pipeline.hpp
xipingyan bbbef65
Merge branch 'xp/enable_qwen_vl_video_preprocess' of https://github.c…
xipingyan 6e33dcf
Rename video to videos, reducing confusion.
xipingyan 6bf63de
Remove useless header.
xipingyan eb4faea
Update video-> videos in Readme
xipingyan 123221b
all video -> videos
xipingyan 515c911
Call images when the models not implement video process.
xipingyan File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Some comments aren't visible on the classic Files Changed page.
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.