Skip to content

Pull requests: microsoft/onnxruntime-genai

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

macOS ARM64 ADO pipeline
#2091 opened Apr 18, 2026 by Copilot AI Loading…
Add Transformers v5 Support
#2089 opened Apr 17, 2026 by sayanshaw24 Collaborator Loading…
WIP: TurboQuant for ORT WebGPU
#2084 opened Apr 14, 2026 by sushraja-msft Contributor Draft
[WebGPU] Support continuous decoding (RewindTo) with graph capture
#2083 opened Apr 13, 2026 by qjia7 Contributor Loading…
extend modelbuilder to build Olmo3, SmolLM3 and other models
#2078 opened Apr 10, 2026 by xadupre Member Loading…
Add onStageComplete
#2074 opened Apr 8, 2026 by apsonawane Contributor Loading…
Enable CUDA graph capture for CUDA EP to improve decode throughput
#2070 opened Apr 7, 2026 by apsonawane Contributor Loading…
Add MIGraphX execution provider support
#2069 opened Apr 5, 2026 by aditya-dl Loading…
Fix: Win32 build failure when paths contain spaces
#2053 opened Apr 1, 2026 by nsubaru Loading…
Add HunYuan Dense V1 (hunyuan_v1_dense) model support
#2045 opened Mar 25, 2026 by amdrajeevp1 Contributor Loading…
[VitisAI] external_ep_library typo fix
#2027 opened Mar 13, 2026 by akholodnamdcom Contributor Loading…
Add Qwen3.5 support
#2025 opened Mar 13, 2026 by kinfey Contributor Loading…
[Don't review] Optimizations for graph capture
#2011 opened Mar 6, 2026 by qjia7 Contributor Draft
ProTip! Filter pull requests by the default branch with base:main.