Uh oh!

There was an error while loading. Please reload this page.

NVIDIA / Model-Optimizer Public

Notifications You must be signed in to change notification settings
Fork 515
Star 3.3k

Code
Issues 85
Pull requests 236
Actions
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security and quality
Insights

Pull requests: NVIDIA/Model-Optimizer

Labels 36 Milestones 0

New pull request New

236 Open 1,362 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[chore]: weekly bump of uv.lock on main (2026-07-27)

#2021 opened Jul 27, 2026 by github-actions Bot

Loading…

docs(puzzletron): install lm-eval in container setup

#2020 opened Jul 27, 2026 by rishiskhare

Loading…

Add unscaled E5M2 fake quantization and Slurm eval support

#2019 opened Jul 26, 2026 by ChenhanYu Collaborator • Draft

Add onnxsim as an alternative ONNX simplification backend

#2018 opened Jul 26, 2026 by take-cheeze

Loading…

fix(export): unblock AWQ export for fused MoE experts (Nemotron-H)

#2017 opened Jul 24, 2026 by cjluo-nv Collaborator • Draft

[OMNIML-5562] Add FAR3D ONNX PTQ and accuracy evaluation example

#2012 opened Jul 23, 2026 by ajrasane Contributor

Loading…

Add ModelOpt QAD skill for Slurm workflows

#2010 opened Jul 23, 2026 by mxinO Contributor • Draft

Add Dockerfile to examples/puzzletron puzzletron_v2

#2009 opened Jul 23, 2026 by grzegorz-k-karch Contributor

Loading…

Single gpu disk offload PTQ for DSR1/Ultra

#2008 opened Jul 23, 2026 by Fridah-nv Contributor

Loading…

Add Codex Day 0 agent roles

#2006 opened Jul 21, 2026 by chadvoegele Contributor • Draft

Add composable scale calibration to GPTQ

#2004 opened Jul 21, 2026 by realAsma Contributor • Draft

Add sidecar GPU/CPU memory+utilization monitor for HF PTQ

#2000 opened Jul 21, 2026 by Fridah-nv Contributor

Loading…

Speed up compressed-tensors load-time matching (for Kimi models)

#1999 opened Jul 21, 2026 by rohansjoshi Contributor

Loading…

Offline-KD QAD example

#1998 opened Jul 20, 2026 by AAnoosheh Contributor

Loading…

Add link to puzzletron_v2

#1996 opened Jul 20, 2026 by Separius Contributor

Loading…

fix(autoquant): score grouped QKV at attention output

#1993 opened Jul 19, 2026 by realAsma Contributor • Draft

Adds skip-softmax calibration through the vLLM serving path

#1992 opened Jul 19, 2026 by kaix-nv Contributor • Draft

Document non-Diffusers FP8/NVFP4 ComfyUI export

#1991 opened Jul 17, 2026 by jingyu-ml Contributor • Draft

feat(rocm): Add AMD ROCm/MI300X support — FP8 hipBLASLt, MIGraphX backend, AMD quantization configs

#1990 opened Jul 17, 2026 by zhihuidu-amd

Loading…

Make group-boundary AutoQuant scoring the default

#1988 opened Jul 17, 2026 by meenchen Contributor • Draft

Add MLflow tracking to modelopt MCP

#1986 opened Jul 17, 2026 by ChenhanYu Collaborator

Loading…

[6425069][ONNX][Autocast] Fix autocast metadata propagation

#1983 opened Jul 16, 2026 by gcunhase Contributor

Loading…

[5726458] Add NVFP4 projection-output-quantizer recipe and HF embedding ONNX export example

#1981 opened Jul 16, 2026 by ajrasane Contributor

Loading…

Scripts and a skill to do per-layer benchmark using flashinfer

#1980 opened Jul 16, 2026 by sychen52 Contributor

Loading…

docs: add AutoQuantize mixed-precision search blog

#1979 opened Jul 15, 2026 by realAsma Contributor • Draft

Previous 1 2 3 4 5 … 9 10 Next

Previous Next

ProTip! Exclude everything labeled bug with -label:bug.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!