Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[CPU]Improve cpu fused moe perf
#27244 opened Oct 21, 2025 by xiangze-arm Loading…
[Bug] Qwen reasoning parser qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#27241 opened Oct 21, 2025 by ahao-anyscale Draft
5 tasks
[CPU]Improve dynamic 4bit moe performance
#27240 opened Oct 21, 2025 by xiangze-arm Loading…
[ResponseAPI] Fix mcp tool type extraction frontend gpt-oss Related to GPT-OSS models
#27234 opened Oct 21, 2025 by Jialin Loading…
3 of 5 tasks
Adds runai distributed streamer ci/build documentation Improvements or additions to documentation rocm Related to AMD ROCm
#27230 opened Oct 20, 2025 by bbartels Loading…
5 tasks
[Feature] Batch Invariant for R1 TP 8 on Blackwell ready ONLY add when PR is ready to merge/full CI is needed
#27229 opened Oct 20, 2025 by yewentao256 Loading…
[Bugfix] Fix broken MTP weight loading for FP8 KV Scales bug Something isn't working deepseek Related to DeepSeek models ready ONLY add when PR is ready to merge/full CI is needed
#27227 opened Oct 20, 2025 by benchislett Loading…
[ROCm][MLA] Support block-size > 1 for AITER MLA backend rocm Related to AMD ROCm v1
#27224 opened Oct 20, 2025 by ganyi1996ppo Loading…
5 tasks
Flashinfer_CUTLASS_MOE fuses quantization for TP
#27223 opened Oct 20, 2025 by wenscarl Loading…
5 tasks
[CORE] Support Prefix Caching with Prompt Embeds documentation Improvements or additions to documentation v1
#27219 opened Oct 20, 2025 by qthequartermasterman Loading…
3 of 5 tasks
[Kernel] Re-enable mrope triton kernel for CUDA/ROCM platform by default rocm Related to AMD ROCm
#27216 opened Oct 20, 2025 by Isotr0py Loading…
1 of 5 tasks
[MXFP4] CT Integration Support
#27214 opened Oct 20, 2025 by dsikka Draft
Add @pavanimajety to .github/codeowners ci/build ready ONLY add when PR is ready to merge/full CI is needed
#27213 opened Oct 20, 2025 by pavanimajety Loading…
5 tasks
[Prefix Cache] Use LoRA name for consistent KV-cache block hashing ready ONLY add when PR is ready to merge/full CI is needed v1
#27211 opened Oct 20, 2025 by sagiahrac Loading…
[Chore] Separate out optional dependency checks from vllm.utils documentation Improvements or additions to documentation gpt-oss Related to GPT-OSS models performance Performance-related issues ready ONLY add when PR is ready to merge/full CI is needed v1
#27207 opened Oct 20, 2025 by dongbo910220 Loading…
[ROCm] Update Triton, Torch, and AITER branches for ROCm base Dockerfile ci/build ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#27206 opened Oct 20, 2025 by micah-wil Loading…
[Frontend] Require flag for loading text and image embeds documentation Improvements or additions to documentation frontend multi-modality Related to multi-modality (#4194) qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed v1
#27204 opened Oct 20, 2025 by russellb Loading… v0.11.1
[Kernel] Support attention sinks in vLLM ci/build v1
#27203 opened Oct 20, 2025 by dudugong-gitch Loading…
3 tasks
ProTip! Adding no:label will show everything without a label.