-
Notifications
You must be signed in to change notification settings - Fork 4k
Pull requests: microsoft/onnxruntime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add INT4 per-channel MoE GEMV fast path for batch-1 decode
#29038
opened Jun 13, 2026 by
tianleiwu
Contributor
Loading…
Bump esbuild, @vitejs/plugin-vue and vite in /js/web/test/e2e/exports/testcases/vite-default
dependencies
Pull requests that update a dependency file
javascript
Pull requests that update Javascript code
#29036
opened Jun 13, 2026 by
dependabot
Bot
Loading…
[web] Fix use-after-free of model buffer with use_ort_model_bytes_for_initializers
#29033
opened Jun 12, 2026 by
yrliou
Loading…
Plugin WebGPU EP Branch: Update protobufjs versions
#29031
opened Jun 12, 2026 by
adrastogi
Contributor
Loading…
Fix WebGPU GatherBlockQuantized dispatch failure for empty indices
#29030
opened Jun 12, 2026 by
tairenpiao
Contributor
Loading…
[WebNN EP] Remove unnecessary cast around normalization ops
#29024
opened Jun 12, 2026 by
Honry
Contributor
Loading…
Fix
ToOrtStatus translation of SYSTEM-category Status error codes
#29023
opened Jun 12, 2026 by
edgchen1
Contributor
Loading…
[WebGPU] Fix profiling timestamp alignment with ORT profiler
#29021
opened Jun 12, 2026 by
daijh
Contributor
Loading…
Bump joi and react-native in /js/react_native/e2e
dependencies
Pull requests that update a dependency file
javascript
Pull requests that update Javascript code
#29019
opened Jun 11, 2026 by
dependabot
Bot
Loading…
[js/web] Forward WebGPU EP buffer cache mode options from JS
#29017
opened Jun 11, 2026 by
ssam18
Contributor
Loading…
Fix libatomic linking on toolchains that default to ld --as-needed
#29008
opened Jun 11, 2026 by
ssam18
Contributor
Loading…
Remove unimodule.json so Expo autolinks onnxruntime-react-native
#29005
opened Jun 11, 2026 by
danielweinmann
•
Draft
webgpu: fix GQA batched right-padded prefill with do_rotary
#29002
opened Jun 11, 2026 by
qjia7
Contributor
Loading…
7 tasks done
Bump torch from 2.7.0 to 2.12.0 in /onnxruntime/python/tools/transformers/models/whisper
dependencies
Pull requests that update a dependency file
python
Pull requests that update Python code
#28993
opened Jun 10, 2026 by
dependabot
Bot
Loading…
[CUDA] Add decode (M=1) GEMV fast path to MatMul
#28986
opened Jun 10, 2026 by
tianleiwu
Contributor
Loading…
[CUDA] Add decode-optimized LinearAttention (GatedDeltaNet) kernels
#28985
opened Jun 10, 2026 by
tianleiwu
Contributor
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-06-10.