Skip to content

Pull requests: microsoft/onnxruntime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WebNN] Fix some spelling and naming issues
#25433 opened Jul 17, 2025 by Honry Loading…
[CUDA] Support head_sink in flash attention for GQA
#25432 opened Jul 17, 2025 by tianleiwu Loading…
2 tasks done
Fix issue with negative dynamic tensor shape
#25431 opened Jul 17, 2025 by bachelor-dou Loading…
Bump transformers from 4.48.0 to 4.52.1 in /tools/ci_build/requirements/transformers-test dependencies Pull requests that update a dependency file python Pull requests that update Python code
#25429 opened Jul 16, 2025 by dependabot bot Loading…
Subgroup matrix
#25416 opened Jul 16, 2025 by xiaofeihan1 Draft
add webgpu support for GatherBlockQuantized ep:WebGPU ort-web webgpu provider
#25413 opened Jul 15, 2025 by guschmue Loading…
Fix the is_leaf check in TreeEnsemble
#25410 opened Jul 15, 2025 by meakbiyik Loading…
[TRT-EP] Add loadModelProto APIs release:1.23.0
#25409 opened Jul 15, 2025 by kevinch-nv Loading…
fix shape inference error for ep context nodes
#25398 opened Jul 15, 2025 by wcy123 Loading…
[webgpu] Optimize FlashAttention for prefill
#25395 opened Jul 15, 2025 by daijh Loading…
[webgpu] use u32 to represent f16 in uniform
#25391 opened Jul 14, 2025 by fs-eire Loading…
Optimize layout for SubgroupMatrixLoad on Intel ep:WebGPU ort-web webgpu provider
#25384 opened Jul 14, 2025 by jchen10 Loading…
add sliding window support for webgpu gqa ep:WebGPU ort-web webgpu provider
#25372 opened Jul 11, 2025 by guschmue Loading…
ProTip! Updated in the last three days: updated:>2025-07-14.