-
-
Notifications
You must be signed in to change notification settings - Fork 11.2k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[bugfix] correct local_chunk_len for DCP in reorg_kvcache with long context
v1
#28526
opened Nov 12, 2025 by
pisceskkk
Loading…
[Bugfix] Missing cached item in the MultiModalReceiverCache
multi-modality
Related to multi-modality (#4194)
#28525
opened Nov 12, 2025 by
knlnguyen1802
Loading…
5 tasks
Fix Llama4 Pipeline Parallelism
llama
Related to Llama models
#28524
opened Nov 12, 2025 by
River12
Loading…
debug tests/tool_use/test_tool_calls.py failure on AMD
ci/build
documentation
Improvements or additions to documentation
frontend
performance
Performance-related issues
rocm
Related to AMD ROCm
tool-calling
[XPU]Fix crash due to removed VLLM_USE_V1 attribute
#28520
opened Nov 12, 2025 by
chaojun-zhang
Loading…
5 tasks
[bugfix] [torch.compile] startup time logging not correct
#28519
opened Nov 12, 2025 by
vnadathur
Loading…
[Misc] don't cache
CUTLASS_REVISION var in CMakeLists.txt
ci/build
nvidia
#28518
opened Nov 12, 2025 by
jinzhen-lin
Loading…
[BugFix] Ensure Something isn't working
ready
ONLY add when PR is ready to merge/full CI is needed
EngineArgs.create_engine_config is idempotent
bug
#28515
opened Nov 12, 2025 by
njhill
Loading…
[Benchmark] Fix client seed synchronization in multi-turn benchmark
performance
Performance-related issues
#28512
opened Nov 12, 2025 by
ai-jz
Loading…
[XPU] Support Triton path for LoRA operations on XPU
ready
ONLY add when PR is ready to merge/full CI is needed
#28511
opened Nov 12, 2025 by
faaany
Loading…
3 of 4 tasks
[quantization][config] enable override existing quant_config
#28510
opened Nov 12, 2025 by
ILikeIneine
Loading…
5 tasks
[Bugfix] Add explicit cuBLAS linking to _C extension
ci/build
#28504
opened Nov 12, 2025 by
scottzh8
Loading…
[Misc] Add In-Container restart capability through supervisord for openai server
ci/build
#28502
opened Nov 12, 2025 by
HappyAmazonian
•
Draft
5 tasks
Add bias tests for GPT OSS and standardize bias params naming
#28499
opened Nov 11, 2025 by
cyrusd98
Loading…
[Kernel] Unified attention kernel performance tuning
#28497
opened Nov 11, 2025 by
cagrikymk
Loading…
[Bugfix] Prevent premature multimodal preprocess cache eviction
multi-modality
Related to multi-modality (#4194)
#28496
opened Nov 11, 2025 by
aykoppol
Loading…
[Performance][Hopper] Avoid M dim padding to 4x for most cases (due to cuda graphs paddings)
deepseek
Related to DeepSeek models
nvidia
ready
ONLY add when PR is ready to merge/full CI is needed
#28492
opened Nov 11, 2025 by
alexm-redhat
Loading…
[TPU] Support GCS path in VLLM_TORCH_PROFILER_DIR
ci/build
#28487
opened Nov 11, 2025 by
QiliangCui
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.