-
Notifications
You must be signed in to change notification settings - Fork 3.5k
Pull requests: microsoft/onnxruntime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[QNN-EP] Add documentation for optrace profiling
#26348
opened Oct 18, 2025 by
quic-calvnguy
Loading…
[build] disable DAWN_FETCH_DEPENDENCIES when using custom dawn src path
#26346
opened Oct 17, 2025 by
fs-eire
Loading…
Save much memory at model loading time by converting weights to OrtValues early
#26345
opened Oct 17, 2025 by
yuslepukhin
Loading…
[webgpu] Fused GeneratePositionIDs into FusedQKRotaryEmbedding
ep:WebGPU
ort-web webgpu provider
#26335
opened Oct 17, 2025 by
xiaofeihan1
Loading…
[webgpu] Don't use num_workgroups when use indirect dispatch
#26334
opened Oct 17, 2025 by
qjia7
Loading…
[QNN EP] Fuse Gelu pattern into a QNN Gelu Node
#26332
opened Oct 17, 2025 by
quic-tirupath
Loading…
[QNN-EP] Implement ABI methods for model compatibility
#26331
opened Oct 16, 2025 by
quic-calvnguy
•
Draft
[WebGPU] allows user to specify high-performance or low-power preference
#26326
opened Oct 16, 2025 by
prathikr
Loading…
feat: Add BFloat16 support for Gemm and MatMul CPU operators
#26317
opened Oct 15, 2025 by
snnn
Loading…
Address MLAS NchwcBlockSize for AMD64 platforms behavior in the minimal builds
#26306
opened Oct 14, 2025 by
yuslepukhin
Loading…
Allow present_key to be empty when past_key is provided in Attention
#26303
opened Oct 14, 2025 by
justinchuby
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.