NVIDIA / Model-Optimizer Public

Notifications You must be signed in to change notification settings
Fork 361
Star 2.5k

Code
Issues 56
Pull requests 127
Actions
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security and quality
Insights

Pull requests: NVIDIA/Model-Optimizer

Labels 31 Milestones 0

New pull request New

127 Open 816 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

fix: PTQ 1GPU, export PP divisibility, hidden states conversations key

#1293 opened Apr 18, 2026 by ChenhanYu Collaborator • Draft

3 tasks done

add gptq fused kernel

#1291 opened Apr 17, 2026 by sychen52 Contributor

Loading…

Add FP8 MHA quantization support for HuggingFace ViT

#1289 opened Apr 17, 2026 by ajrasane Contributor

Loading…

4 tasks

keep deploy cases and Eagle fixes for merge

#1287 opened Apr 17, 2026 by nvSiruiW

Loading…

Update excluded modules for Qwen3.5 dense PTQ

#1284 opened Apr 17, 2026 by amukkara

Loading…

[minor] Add custom calibration backend registry

#1281 opened Apr 16, 2026 by Fridah-nv Contributor

Loading…

Add qwen3 moe experts only test

#1274 opened Apr 16, 2026 by cjluo-nv Collaborator

Loading…

SpecDec Bench: April Update

#1272 opened Apr 16, 2026 by IzzyPutterman Contributor

Loading…

[Feat,Refactor]: Offline Dflash; Spec Mixin; Deprecate parallel draft;

#1271 opened Apr 16, 2026 by h-guo18 Contributor

Loading…

Skip Softmax diffusion export

#1269 opened Apr 15, 2026 by jingyu-ml Contributor

Loading…

Centralize 'trtexec' subprocess runs in ONNX into a single function

#1268 opened Apr 15, 2026 by gcunhase Contributor

Loading…

Handle zero-amax per-channel activation scaling for MoE export

#1265 opened Apr 15, 2026 by AEON-7

Loading…

Fix non-scalar input amax in preprocess_linear_fusion for MoE export

#1264 opened Apr 15, 2026 by AEON-7

Loading…

Exclude small-k and small-n Matmul nodes from Int8 quantization

#1256 opened Apr 14, 2026 by nv-samcheng Contributor

Loading…

Add EfficientViT support for torch_onnx quantization workflow

#1254 opened Apr 14, 2026 by ajrasane Contributor

Loading…

3 tasks done

Add a general composable $import system for YAML configs, and use it to implement composable recipes

#1253 opened Apr 14, 2026 by shengliangxu Collaborator

Loading…

fix(launcher): use afterany dependency for allow_to_fail pipelines

#1248 opened Apr 13, 2026 by yeyu-nvidia Contributor

Loading…

3 tasks

Add LAQ (Learnable Amax Quantization) algorithm

#1247 opened Apr 13, 2026 by realAsma Contributor

Loading…

4 tasks

vLLM fakequant export update for AWQ checkpoint

#1242 opened Apr 13, 2026 by kinjalpatel27 Contributor

Loading…

feat: parallelize fakequant export across GPUs via ThreadPoolExecutor

#1241 opened Apr 13, 2026 by kinjalpatel27 Contributor

Loading…

[1/N] Polish evaluation skills and common skills based on an E2E workflow testing, vendor two Claude skills from NeMo Evaluator

#1239 opened Apr 12, 2026 by Edwardf0t1 Contributor

Loading…

support Qwen3.5 quantization

#1230 opened Apr 10, 2026 by deepindeed2022

Loading…

[2/3] Implicit Gemm NVFP4

#1227 opened Apr 9, 2026 by jingyu-ml Contributor

Loading…

Add Gemma4 MoE quantization support

#1219 opened Apr 9, 2026 by yueshen2016 Contributor

Loading…

4 tasks done

Add WaterSIC for KV-cache quantization

#1217 opened Apr 9, 2026 by kaix-nv Contributor • Draft

Previous 1 2 3 4 5 6 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!