Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix intermediatetensors spawn error #27591 qwen Related to Qwen models
#27594 opened Oct 27, 2025 by baonudesifeizhai Loading…
5 tasks
Remove deprecated fields from CompilationConfig documentation Improvements or additions to documentation llama Related to Llama models v1
#27593 opened Oct 27, 2025 by hmellor Loading…
[Stability fix] turn off HMA allocator when connector is set ready ONLY add when PR is ready to merge/full CI is needed
#27592 opened Oct 27, 2025 by KuntaiDu Loading…
5 tasks
v0.11.1
[Bug] Fix shape issue for eplb expert weights ready ONLY add when PR is ready to merge/full CI is needed
#27589 opened Oct 27, 2025 by yewentao256 Loading…
[Misc] Separate out utils.counter and move utils.Device to engine frontend ready ONLY add when PR is ready to merge/full CI is needed v1
#27588 opened Oct 27, 2025 by DarkLight1337 Loading…
5 tasks
[Build] Optimize docker layers for better caching ci/build ready ONLY add when PR is ready to merge/full CI is needed
#27585 opened Oct 27, 2025 by rzabarazesh Loading…
5 tasks
Rename clashing method names for vLLM model protocol deepseek Related to DeepSeek models documentation Improvements or additions to documentation gpt-oss Related to GPT-OSS models llama Related to Llama models multi-modality Related to multi-modality (#4194) qwen Related to Qwen models speculative-decoding tpu Related to Google TPUs v1
#27583 opened Oct 27, 2025 by hmellor Loading…
[ROCm][GEMM] update aiter fp8 linear check for different platforms rocm Related to AMD ROCm
#27579 opened Oct 27, 2025 by zhuyuhua-v Loading…
[perf] Enable concurrent execution of "shared_experts" and "selected_experts" qwen Related to Qwen models
#27578 opened Oct 27, 2025 by ZJY0516 Loading…
5 tasks
[CI][Bugfix] Fix triton import check error
#27574 opened Oct 27, 2025 by MengqingCao Draft
5 tasks
[DeepSeek v3.2] Make top-k work for any logit values. deepseek Related to DeepSeek models
#27568 opened Oct 27, 2025 by dcampora Loading…
5 tasks
Fix a robust parsing issue in KimiK2ToolParser that causes IndexError frontend ready ONLY add when PR is ready to merge/full CI is needed tool-calling
#27565 opened Oct 27, 2025 by wangln19 Loading…
3 of 10 tasks
[Bugfix][Rocm] Fix shared expert weight loading failure in DeepSeek-MTP deepseek Related to DeepSeek models rocm Related to AMD ROCm
#27563 opened Oct 27, 2025 by zhyajie Loading…
[Bugfix] Validate custom logits processor xargs for online serving deepseek Related to DeepSeek models documentation Improvements or additions to documentation frontend ready ONLY add when PR is ready to merge/full CI is needed v1
#27560 opened Oct 27, 2025 by Isotr0py Draft
1 of 5 tasks
[Bugfix] fixed inconsistent finish_reason handling between V0 and V1 engines ready ONLY add when PR is ready to merge/full CI is needed v1
#27555 opened Oct 27, 2025 by chaunceyjiang Loading…
5 tasks
covt_e4m3_bf16
#27553 opened Oct 27, 2025 by wangyxbh Loading…
5 tasks
[CI/Build] Test torchrun with 8 cards ci/build documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed
#27548 opened Oct 27, 2025 by 22quinn Loading…
3 of 5 tasks
[Docs] add Shanghai Meetup - 2025/10 documentation Improvements or additions to documentation
#27545 opened Oct 27, 2025 by kebe7jun Loading…
5 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.