-
-
Notifications
You must be signed in to change notification settings - Fork 9.2k
[Model] Refactoring of MiniCPM-V and add MiniCPM-o-2.6 support for vLLM #12069
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
DarkLight1337
merged 118 commits into
vllm-project:main
from
HwwwwwwwH:minicpmv-refactor
Jan 29, 2025
Merged
Changes from 115 commits
Commits
Show all changes
118 commits
Select commit
Hold shift + click to select a range
f78ad12
refactor for images
HwwwwwwwH 95230b9
supprot image embedding for minicpmv
HwwwwwwwH 42ffb1b
[Bugfix][SpecDecode] Adjust Eagle model architecture to align with in…
llsj14 43ff2e9
[Bugfix] fused_experts_impl wrong compute type for float32 (#11921)
shaochangxu 0ec9974
[CI/Build] Move model-specific multi-modal processing tests (#11934)
DarkLight1337 b4a9094
[Doc] Basic guide for writing unit tests for new models (#11951)
DarkLight1337 ac29198
[Bugfix] Fix RobertaModel loading (#11940)
NickLucche 286107f
[Model] Add cogagent model support vLLM (#11742)
sixsixcoder 535e120
[V1] Avoid sending text prompt to core engine (#11963)
ywang96 925562b
[CI/Build] Add markdown linter (#11857)
rafvasq 936b306
[Model] Initialize support for Deepseek-VL2 models (#11578)
Isotr0py 141151f
[Hardware][CPU] Multi-LoRA implementation for the CPU backend (#11100)
Akshat-Tripathi eac7811
[Hardware][TPU] workaround fix for MoE on TPU (#11764)
avshalomman e251866
[V1][Core][1/n] Logging and Metrics (#11962)
robertgshaw2-redhat e46c06b
[Model] Support GGUF models newly added in `transformers` 4.46.0 (#9685)
Isotr0py d12c0de
[V1] [2/n] Logging and Metrics - `OutputProcessor` Abstraction (#11973)
robertgshaw2-redhat e459c90
[MISC] fix typo in kv transfer send recv test (#11983)
yyccli 93a78ba
[Bug] Fix usage of `.transpose()` and `.view()` consecutively. (#11979)
liaoyanqing666 dd2f627
[CI][Spec Decode] fix: broken test for EAGLE model (#11972)
llsj14 570e067
[Misc] Fix Deepseek V2 fp8 kv-scale remapping (#11947)
Concurrensee eaccb74
[Misc]Minor Changes about Worker (#11555)
noemotiovon 7adb4a0
[platform] add ray_device_key (#11948)
youkaichao a014ddd
Fix Max Token ID for Qwen-VL-Chat (#11980)
alex-jw-brooks cedf6cc
[Kernel] unified_attention for Attention.forward (#11967)
heheda12345 a1f053f
[Doc][V1] Update model implementation guide for V1 support (#11998)
ywang96 651ee49
[Doc] Organise installation documentation into categories and tabs (#…
hmellor adc0b54
[platform] add device_control env var (#12009)
youkaichao 1fa0b25
[Platform] Move get_punica_wrapper() function to Platform (#11516)
shen-shanshan e55869e
bugfix: Fix signature mismatch in benchmark's `get_tokenizer` functio…
e1ijah1 7f2aa68
[Doc] Fix build from source and installation link in README.md (#12013)
Yikun a1f0814
[Bugfix] Fix deepseekv3 gate bias error (#12002)
SunflowerAries 0ca468e
[Docs] Add Sky Computing Lab to project intro (#12019)
WoosukKwon 6bec0d0
[HPU][Bugfix] set_forward_context and CI test execution (#12014)
kzawora-intel 0badf14
[Doc] Update Quantization Hardware Support Documentation (#12025)
tjtanaa c6a5060
[HPU][misc] add comments for explanation (#12034)
youkaichao 055a2b7
[Bugfix] Fix various bugs in multi-modal processor (#12031)
DarkLight1337 941a5d5
[Kernel] Revert the API change of Attention.forward (#12038)
heheda12345 3a05c49
[Platform] Add output for Attention Backend (#11981)
wangxiyuan 87a687b
[Bugfix][Kernel] Give unique name to BlockSparseFlashAttention (#12040)
heheda12345 3183e6a
Explain where the engine args go when using Docker (#12041)
hmellor cc9cde5
[Doc]: Update the Json Example of the `Engine Arguments` document (#1…
maang-h 58d45cd
[Misc] Merge bitsandbytes_stacked_params_mapping and packed_modules_…
jeejeelee bb13b8a
[Kernel] Support MulAndSilu (#11624)
jeejeelee 1bba3f6
[HPU][Bugfix] Don't use /dev/accel/accel0 for HPU autodetection in se…
kzawora-intel ef22c6c
[Platform] move current_memory_usage() into platform (#11369)
shen-shanshan 94adbff
[V1][BugFix] Fix edge case in VLM scheduling (#12065)
WoosukKwon 654f5d7
[Misc] Add multipstep chunked-prefill support for FlashInfer (#10467)
elfiegg 8146c68
[core] Turn off GPU communication overlap for Ray executor (#12051)
ruisearch42 59e5cf4
[core] platform agnostic executor via collective_rpc (#11256)
youkaichao 920038b
merge main
HwwwwwwwH 6f6d2eb
video embedding supports
HwwwwwwwH 364bca1
update support for minicpmo on images and videos
HwwwwwwwH c2d8dbb
audio language
HwwwwwwwH 1ba77eb
audio embedding inputs
HwwwwwwwH 1c6f7d8
format
HwwwwwwwH 26d40a5
merge main x
HwwwwwwwH 24d9a80
merge main
HwwwwwwwH 29774db
Merge branch 'main' of https://github.com/vllm-project/vllm into mini…
jeejeelee 6c409c5
docs/server-chat-utils/tests for minicpmo
HwwwwwwwH ee2f7da
Merge branch 'minicpmv-refactor' of github.com:HwwwwwwwH/vllm into mi…
HwwwwwwwH 42e7e78
Update docs/source/models/supported_models.md
HwwwwwwwH 6c0a686
Update tests/models/decoder_only/vision_language/test_models.py
HwwwwwwwH c15228b
format
HwwwwwwwH c51026d
Merge branch 'minicpmv-refactor' of github.com:HwwwwwwwH/vllm into mi…
HwwwwwwwH ac26f59
split minicpmo in a separate file
HwwwwwwwH 8b0cbf7
format
HwwwwwwwH 428ae5a
Update vllm/model_executor/models/minicpmo.py
HwwwwwwwH edfac98
add hints
HwwwwwwwH 4ed8b11
format
HwwwwwwwH b44085e
clean unnecessary logic of WhisperEncoder
HwwwwwwwH 763c578
format
HwwwwwwwH cd68484
Update vllm/model_executor/models/minicpmo.py
HwwwwwwwH 1e47208
add torchaudio for test
HwwwwwwwH 781d1c3
add annotations
HwwwwwwwH f0b0270
format
HwwwwwwwH ed1dd9e
Merge remote-tracking branch 'upstream/main' into minicpmv-refactor
ywang96 6d5978a
enable MiniCPMV-MiniCPMO for cache
HwwwwwwwH 3bb67f8
Merge branch 'minicpmv-refactor' of github.com:HwwwwwwwH/vllm into mi…
HwwwwwwwH 25d86ce
add multimodal tests for minicpmv
HwwwwwwwH bec9a73
format
HwwwwwwwH 2120dd6
custom_hf_runner for minicpmo
HwwwwwwwH 0fd4347
Merge branch 'minicpmv-refactor' of github.com:HwwwwwwwH/vllm into mi…
HwwwwwwwH 6d2f4e4
format
HwwwwwwwH fac61eb
pass all tests
HwwwwwwwH 6037606
format / pass all tests
HwwwwwwwH b6f24f7
fix num_slices bug
HwwwwwwwH e439d3a
Merge branch 'minicpmv-refactor' of github.com:HwwwwwwwH/vllm into mi…
HwwwwwwwH 0f67ac9
add examples
HwwwwwwwH eab479f
add examples and format tests
HwwwwwwwH 05a0ef8
format
HwwwwwwwH 6650450
Update tests/models/decoder_only/vision_language/vlm_utils/model_util…
HwwwwwwwH 8f5b069
Update vllm/model_executor/models/minicpmv.py
HwwwwwwwH de0b55f
Update vllm/model_executor/models/minicpmv.py
HwwwwwwwH ad52859
Update vllm/model_executor/models/minicpmv.py
HwwwwwwwH c5b912d
Update vllm/model_executor/models/minicpmv.py
HwwwwwwwH 00e9e5a
Update vllm/model_executor/models/minicpmo.py
HwwwwwwwH 49ea11e
alphabet
HwwwwwwwH 595c679
add annotations
HwwwwwwwH 061596f
Merge branch 'minicpmv-refactor' of github.com:HwwwwwwwH/vllm into mi…
HwwwwwwwH 26d4b2b
add torchaudio dependency
HwwwwwwwH 5867171
format
HwwwwwwwH bed7843
torchaudio
HwwwwwwwH 715bd9f
fix minicpmo_patch_hf_runner
HwwwwwwwH cf4788f
fix slice bug
HwwwwwwwH 53c679e
Merge branch 'main' into minicpmv-refactor
HwwwwwwwH 3127a6b
format
HwwwwwwwH 290795b
test model register
HwwwwwwwH d9dedd7
delete minicpmv2.5 in test_common
HwwwwwwwH f6d5cfa
add dependencies of minicpmo audio tests
HwwwwwwwH da2ddd3
format
HwwwwwwwH 4222899
add vocos in requirements_test.in
HwwwwwwwH 26ebc7c
Merge branch 'minicpmv-refactor' of github.com:HwwwwwwwH/vllm into mi…
HwwwwwwwH 2e93896
alphabet in example file and server
HwwwwwwwH 0dfa513
Merge branch 'main' into minicpmv-refactor
HwwwwwwwH dadd030
Merge branch 'main' into minicpmv-refactor
DarkLight1337 f5a188a
merge main && fix conflict
HwwwwwwwH 8216fd5
delete vocos in setup.py
HwwwwwwwH 4cfd785
update docs
HwwwwwwwH File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.