Skip to content

Pull requests: ROCm/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Triton] 355 wip fused triton
#666 opened Sep 10, 2025 by k50112113 Loading…
support rocblas for rocm_unquantized_gemm
#665 opened Sep 10, 2025 by eliotwang Loading…
[355_wip] Add in-place rms_norm kernels and their quant fusions.
#661 opened Sep 9, 2025 by xytpai Loading…
3 tasks done
Add cache config for gpt oss
#656 opened Sep 5, 2025 by cagrikymk Draft
fix flashmla metadata build calls()
#636 opened Aug 19, 2025 by ZJLi2013 Loading…
[Model] Add GPT-OSS model code and config
#625 opened Aug 7, 2025 by ashishtanwer Loading…
add Fused_rms_quant for deepseek_v2 model
#611 opened Jul 29, 2025 by ZJLi2013 Loading…
[FEAT] [ROCm] Shared Experts Aiter
#605 opened Jul 25, 2025 by tjtanaavllm Loading…
add fused fp8 bmm
#604 opened Jul 25, 2025 by k50112113 Loading…
Update fp8 paged attention
#592 opened Jul 9, 2025 by amd-xiaoyu12 Draft
Update test-template.j2
#579 opened Jun 16, 2025 by okakarpa Loading…
Disable skynny gemms by default unstale
#568 opened Jun 5, 2025 by k-artem Loading…
Test Queues
#456 opened Feb 28, 2025 by dhonnappa-amd Draft
ProTip! Follow long discussions with comments:>50.