-
Notifications
You must be signed in to change notification settings - Fork 298
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Bugfix][CI] Update
ModelRunnerOutput
Params to Sync Upstream Changes
#2189
opened Aug 4, 2025 by
shen-shanshan
Loading…
SwiftBalancer Zero OverHead Expert Movement
module:core
module:ops
module:quantization
#2186
opened Aug 4, 2025 by
raindaywhu
Loading…
[BugFix] Fix the bug that qwen3 moe doesn't work with EP
accuracy-test
enable all accuracy test for PR
module:ops
ready-for-test
start test by label for PR
#2183
opened Aug 4, 2025 by
wangxiyuan
Loading…
[Bugfix] Fix broken CI
accuracy-test
enable all accuracy test for PR
module:tests
ready-for-test
start test by label for PR
vllm-break
#2181
opened Aug 3, 2025 by
Potabk
Loading…
[0.9.1]remove chunked_prefill_for_mla
documentation
Improvements or additions to documentation
module:core
module:ops
#2177
opened Aug 2, 2025 by
fems14
Loading…
[Bugfix][PD] Auto-clear producer KV cache if no pull notification
module:core
#2174
opened Aug 2, 2025 by
underfituu
Loading…
[main][Feature] Support deepseek w4a8 quantization
module:quantization
module:tests
#2172
opened Aug 1, 2025 by
kunpengW-code
Loading…
[feat]: oproj tensor parallelism in pure DP and graph-mode scenarios.
module:core
module:ops
module:quantization
#2167
opened Aug 1, 2025 by
lidenghui1110
Loading…
[Doc][PD] Restore the default configuration items in examples/disaggregate_prefill_v1/README.md
documentation
Improvements or additions to documentation
#2165
opened Aug 1, 2025 by
underfituu
Loading…
Fix accuracy test config --config-list-file
accuracy-test
enable all accuracy test for PR
module:tests
ready-for-test
start test by label for PR
#2163
opened Aug 1, 2025 by
wxsIcey
Loading…
[Misc] Support kimi-k2-w8a8
documentation
Improvements or additions to documentation
module:core
module:quantization
#2162
opened Aug 1, 2025 by
Potabk
Loading…
[main][Bugfix] Fix unable to load qwen3_moe quantized weights
module:tests
#2161
opened Aug 1, 2025 by
zhoux77899
Loading…
add super kernel for decode moe
module:core
module:ops
module:quantization
#2157
opened Aug 1, 2025 by
NNUCJ
Loading…
Remove cat
ci/build
documentation
Improvements or additions to documentation
module:core
module:ops
module:quantization
module:tests
module:tools
#2153
opened Jul 31, 2025 by
loukong33
Loading…
ut: add example and e2e test for sleepmode in external_launcher
module:tests
#2152
opened Jul 31, 2025 by
loukong33
Loading…
refactor fused_moe.py
merge-conflicts
module:ops
module:quantization
#2150
opened Jul 31, 2025 by
shiyuan680
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.