-
Notifications
You must be signed in to change notification settings - Fork 3.3k
Pull requests: microsoft/onnxruntime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[CUDA] Support head_sink in flash attention for GQA
#25432
opened Jul 17, 2025 by
tianleiwu
Loading…
2 tasks done
Bump transformers from 4.48.0 to 4.52.1 in /tools/ci_build/requirements/transformers-test
dependencies
Pull requests that update a dependency file
python
Pull requests that update Python code
#25429
opened Jul 16, 2025 by
dependabot
bot
Loading…
Enable free dimension override for graph optimization level 0
release:1.23.0
#25425
opened Jul 16, 2025 by
chilo-ms
Loading…
[EP ABI] Add Graph_GetModelPath API function
release:1.23.0
#25424
opened Jul 16, 2025 by
adrianlizarraga
Loading…
add webgpu support for GatherBlockQuantized
ep:WebGPU
ort-web webgpu provider
#25413
opened Jul 15, 2025 by
guschmue
Loading…
[EP ABI] Add documentation for OrtValue and ort_graph_to_proto util
release:1.23.0
#25411
opened Jul 15, 2025 by
adrianlizarraga
Loading…
'QnnEpFactory' should provide a fully-qualified path to the backend
release:1.23.0
#25407
opened Jul 15, 2025 by
mschofie
Loading…
Update fusion_attention to properly convert bfloat16 values
#25404
opened Jul 15, 2025 by
justinchuby
Loading…
[WebGPU EP] allow concat operator to handle large number of inputs
#25390
opened Jul 14, 2025 by
prathikr
Loading…
Update the customer op API ReadOpAttr for string type to avoid adding '\0' to the end
release:1.23.0
#25389
opened Jul 14, 2025 by
HectorSVC
Loading…
Optimize layout for SubgroupMatrixLoad on Intel
ep:WebGPU
ort-web webgpu provider
#25384
opened Jul 14, 2025 by
jchen10
Loading…
add sliding window support for webgpu gqa
ep:WebGPU
ort-web webgpu provider
#25372
opened Jul 11, 2025 by
guschmue
Loading…
[NV RTX EP] Upstream changes from the win-ort
release:1.23.0
#25370
opened Jul 11, 2025 by
ishwar-raut1
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-07-14.