Skip to content

Pull requests: NVIDIA/Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: PTQ 1GPU, export PP divisibility, hidden states conversations key
#1293 opened Apr 18, 2026 by ChenhanYu Collaborator Draft
3 tasks done
add gptq fused kernel
#1291 opened Apr 17, 2026 by sychen52 Contributor Loading…
Add FP8 MHA quantization support for HuggingFace ViT
#1289 opened Apr 17, 2026 by ajrasane Contributor Loading…
4 tasks
keep deploy cases and Eagle fixes for merge
#1287 opened Apr 17, 2026 by nvSiruiW Loading…
Update excluded modules for Qwen3.5 dense PTQ
#1284 opened Apr 17, 2026 by amukkara Loading…
[minor] Add custom calibration backend registry
#1281 opened Apr 16, 2026 by Fridah-nv Contributor Loading…
Add qwen3 moe experts only test
#1274 opened Apr 16, 2026 by cjluo-nv Collaborator Loading…
SpecDec Bench: April Update
#1272 opened Apr 16, 2026 by IzzyPutterman Contributor Loading…
Skip Softmax diffusion export
#1269 opened Apr 15, 2026 by jingyu-ml Contributor Loading…
Centralize 'trtexec' subprocess runs in ONNX into a single function
#1268 opened Apr 15, 2026 by gcunhase Contributor Loading…
Exclude small-k and small-n Matmul nodes from Int8 quantization
#1256 opened Apr 14, 2026 by nv-samcheng Contributor Loading…
Add EfficientViT support for torch_onnx quantization workflow
#1254 opened Apr 14, 2026 by ajrasane Contributor Loading…
3 tasks done
fix(launcher): use afterany dependency for allow_to_fail pipelines
#1248 opened Apr 13, 2026 by yeyu-nvidia Contributor Loading…
3 tasks
Add LAQ (Learnable Amax Quantization) algorithm
#1247 opened Apr 13, 2026 by realAsma Contributor Loading…
4 tasks
vLLM fakequant export update for AWQ checkpoint
#1242 opened Apr 13, 2026 by kinjalpatel27 Contributor Loading…
support Qwen3.5 quantization
#1230 opened Apr 10, 2026 by deepindeed2022 Loading…
[2/3] Implicit Gemm NVFP4
#1227 opened Apr 9, 2026 by jingyu-ml Contributor Loading…
Add Gemma4 MoE quantization support
#1219 opened Apr 9, 2026 by yueshen2016 Contributor Loading…
4 tasks done
Add WaterSIC for KV-cache quantization
#1217 opened Apr 9, 2026 by kaix-nv Contributor Draft
ProTip! Mix and match filters to narrow down what you’re looking for.