-
-
Notifications
You must be signed in to change notification settings - Fork 10.7k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Mirroring changes in test-pipeline.yaml into test-amd.yaml
ci/build
rocm
Related to AMD ROCm
#27242
opened Oct 21, 2025 by
Alexei-V-Ivanov-AMD
Loading…
[Bug] Qwen reasoning parser
qwen
Related to Qwen models
ready
ONLY add when PR is ready to merge/full CI is needed
#27241
opened Oct 21, 2025 by
ahao-anyscale
•
Draft
5 tasks
v1/kv_cache_utils: Respect num_gpu_blocks_override in memory check
v1
#27238
opened Oct 21, 2025 by
khaled-wsa
Loading…
[CI/Testing] Add basic single node dual batch overlap test
ci/build
v1
#27235
opened Oct 21, 2025 by
LucasWilkinson
Loading…
[ResponseAPI] Fix mcp tool type extraction
frontend
gpt-oss
Related to GPT-OSS models
#27234
opened Oct 21, 2025 by
Jialin
Loading…
3 of 5 tasks
[Bugfix] Ensure calculated KV scales are applied in attention.
v1
#27232
opened Oct 21, 2025 by
adabeyta
Loading…
Adds runai distributed streamer
ci/build
documentation
Improvements or additions to documentation
rocm
Related to AMD ROCm
#27230
opened Oct 20, 2025 by
bbartels
Loading…
5 tasks
[Feature] Batch Invariant for R1 TP 8 on Blackwell
ready
ONLY add when PR is ready to merge/full CI is needed
#27229
opened Oct 20, 2025 by
yewentao256
Loading…
[Bugfix] Fix broken MTP weight loading for FP8 KV Scales
bug
Something isn't working
deepseek
Related to DeepSeek models
ready
ONLY add when PR is ready to merge/full CI is needed
#27227
opened Oct 20, 2025 by
benchislett
Loading…
[ROCm][MLA] Support block-size > 1 for AITER MLA backend
rocm
Related to AMD ROCm
v1
#27224
opened Oct 20, 2025 by
ganyi1996ppo
Loading…
5 tasks
Flashinfer_CUTLASS_MOE fuses quantization for TP
#27223
opened Oct 20, 2025 by
wenscarl
Loading…
5 tasks
ARM64 CUDA 12.9 wheels built and uploaded to index incorrectly
ci/build
#27221
opened Oct 20, 2025 by
Gregory-Pereira
Loading…
[Bugfix] Fix dp_chunking enablement logic in FusedMoE layer
#27220
opened Oct 20, 2025 by
alexm-redhat
Loading…
[CORE] Support Prefix Caching with Prompt Embeds
documentation
Improvements or additions to documentation
v1
#27219
opened Oct 20, 2025 by
qthequartermasterman
Loading…
3 of 5 tasks
[Backend][WIP] Integrate MPK (Mirage) compiler as an experimental execution backend to vLLM
v1
#27218
opened Oct 20, 2025 by
NorthmanPKU
•
Draft
1 of 8 tasks
[Kernel] Re-enable mrope triton kernel for CUDA/ROCM platform by default
rocm
Related to AMD ROCm
#27216
opened Oct 20, 2025 by
Isotr0py
Loading…
1 of 5 tasks
Add @pavanimajety to .github/codeowners
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#27213
opened Oct 20, 2025 by
pavanimajety
Loading…
5 tasks
[Prefix Cache] Use LoRA name for consistent KV-cache block hashing
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#27211
opened Oct 20, 2025 by
sagiahrac
Loading…
[Chore] Separate out optional dependency checks from vllm.utils
documentation
Improvements or additions to documentation
gpt-oss
Related to GPT-OSS models
performance
Performance-related issues
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#27207
opened Oct 20, 2025 by
dongbo910220
Loading…
[Frontend] Require flag for loading text and image embeds
documentation
Improvements or additions to documentation
frontend
multi-modality
Related to multi-modality (#4194)
qwen
Related to Qwen models
ready
ONLY add when PR is ready to merge/full CI is needed
v1
[Kernel] Support attention sinks in vLLM
ci/build
v1
#27203
opened Oct 20, 2025 by
dudugong-gitch
Loading…
3 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.