-
-
Couldn't load subscription status.
- Fork 10.8k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Model] fix glm4_moe_mtp load weights with GLM-4.6 checkpoint.
#27597
opened Oct 27, 2025 by
wuyaoxuehun
Loading…
[Bugfix][Frontend] validate arg priority in frontend LLM class before add request
frontend
#27596
opened Oct 27, 2025 by
junpuf
Loading…
3 tasks
Fix intermediatetensors spawn error #27591
qwen
Related to Qwen models
#27594
opened Oct 27, 2025 by
baonudesifeizhai
Loading…
5 tasks
Remove deprecated fields from Improvements or additions to documentation
llama
Related to Llama models
v1
CompilationConfig
documentation
#27593
opened Oct 27, 2025 by
hmellor
Loading…
[Stability fix] turn off HMA allocator when connector is set
ready
ONLY add when PR is ready to merge/full CI is needed
[Bug] Fix shape issue for eplb expert weights
ready
ONLY add when PR is ready to merge/full CI is needed
#27589
opened Oct 27, 2025 by
yewentao256
Loading…
[Misc] Separate out ONLY add when PR is ready to merge/full CI is needed
v1
utils.counter and move utils.Device to engine
frontend
ready
#27588
opened Oct 27, 2025 by
DarkLight1337
Loading…
5 tasks
[Build] Optimize docker layers for better caching
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#27585
opened Oct 27, 2025 by
rzabarazesh
Loading…
5 tasks
Rename clashing method names for vLLM model protocol
deepseek
Related to DeepSeek models
documentation
Improvements or additions to documentation
gpt-oss
Related to GPT-OSS models
llama
Related to Llama models
multi-modality
Related to multi-modality (#4194)
qwen
Related to Qwen models
speculative-decoding
tpu
Related to Google TPUs
v1
#27583
opened Oct 27, 2025 by
hmellor
Loading…
[ROCm][GEMM] update aiter fp8 linear check for different platforms
rocm
Related to AMD ROCm
#27579
opened Oct 27, 2025 by
zhuyuhua-v
Loading…
[perf] Enable concurrent execution of "shared_experts" and "selected_experts"
qwen
Related to Qwen models
#27578
opened Oct 27, 2025 by
ZJY0516
Loading…
5 tasks
[Prefix Cache] Include lora_name in BlockStored event for deterministic KV-cache reconstruction
documentation
Improvements or additions to documentation
kv-connector
v1
#27577
opened Oct 27, 2025 by
sagiahrac
Loading…
[CI][Bugfix] Fix triton import check error
#27574
opened Oct 27, 2025 by
MengqingCao
•
Draft
5 tasks
[Bugfix][P/D] Fix throughput stats in disaggregated setup
kv-connector
v1
#27569
opened Oct 27, 2025 by
NickLucche
Loading…
[DeepSeek v3.2] Make top-k work for any logit values.
deepseek
Related to DeepSeek models
#27568
opened Oct 27, 2025 by
dcampora
Loading…
5 tasks
Fix a robust parsing issue in KimiK2ToolParser that causes IndexError
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
tool-calling
#27565
opened Oct 27, 2025 by
wangln19
Loading…
3 of 10 tasks
[Misc] Replace CUDA_VISIBLE_DEVICES in DP with torch.cuda.set_device for device selection on cuda-like devices
kv-connector
v1
#27564
opened Oct 27, 2025 by
ilmarkov
Loading…
5 tasks
[Bugfix][Rocm] Fix shared expert weight loading failure in DeepSeek-MTP
deepseek
Related to DeepSeek models
rocm
Related to AMD ROCm
#27563
opened Oct 27, 2025 by
zhyajie
Loading…
[Bugfix] Validate custom logits processor xargs for online serving
deepseek
Related to DeepSeek models
documentation
Improvements or additions to documentation
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
v1
[Bugfix] fixed inconsistent finish_reason handling between V0 and V1 engines
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#27555
opened Oct 27, 2025 by
chaunceyjiang
Loading…
5 tasks
[CI/Build] Test torchrun with 8 cards
ci/build
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#27548
opened Oct 27, 2025 by
22quinn
Loading…
3 of 5 tasks
[BUG] Fix hybrid kvcache kernel page size issue
v1
#27547
opened Oct 27, 2025 by
vadiklyutiy
Loading…
[Docs] add Shanghai Meetup - 2025/10
documentation
Improvements or additions to documentation
#27545
opened Oct 27, 2025 by
kebe7jun
Loading…
5 tasks
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.