-
Notifications
You must be signed in to change notification settings - Fork 337
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Replace in-repo LLM ONNX export with TensorRT-Edge-LLM
#1210
opened Apr 8, 2026 by
ajrasane
Loading…
fix: megatron export correctness for TP>1 GQA, single-file MTP, and Hub remote code
cherry-pick
After code freeze, cherry-pick into release branch for next rc. Only for bug fixes and doc updates
#1209
opened Apr 8, 2026 by
kevalmorabia97
Loading…
Consolidate lm-eval scripts: merge AnyModel auto-detection into lm_eval_hf.py
#1206
opened Apr 8, 2026 by
j-rausch
Loading…
Add Z-Image (NextDiT/Lumina2) PTQ quantization support in diffusers example
#1205
opened Apr 8, 2026 by
andrea-pilzer
Loading…
Add support for postprocess exported model for block scale swizzling and support for different padding strategy
#1195
opened Apr 8, 2026 by
ynankani
Loading…
fix: handle accelerate CPU-offloaded models in FakeQuant export
#1194
opened Apr 8, 2026 by
sungsooha
Loading…
Validate non-empty cfg when enabling quantizers in quant_cfg
#1192
opened Apr 7, 2026 by
shengliangxu
Loading…
Simplify KDTrainer and enhance ModelOptHFTrainer
#1191
opened Apr 7, 2026 by
realAsma
Loading…
4 of 6 tasks
Add ModelOpt Triton attention kernels for WAN2.2 diffusion (sparse, skip-softmax, NVFP4)
#1190
opened Apr 7, 2026 by
yeyu-nvidia
Loading…
5 tasks
Generic Fused MoE Quantization + Export for transformers 5.0+
#1187
opened Apr 7, 2026 by
Edwardf0t1
Loading…
2 of 3 tasks
[chore]: weekly bump of uv.lock on main (2026-04-06)
#1180
opened Apr 6, 2026 by
github-actions
bot
Loading…
feat: parallelize fakequant export across GPUs via ThreadPoolExecutor
#1177
opened Apr 3, 2026 by
sungsooha
Loading…
[1/N] Refactor llm_qat example: YAML configs + ModelOptArgParser
#1172
opened Apr 2, 2026 by
realAsma
Loading…
3 of 4 tasks
[minor] add a general FP8ScaleSweepCalibrator and its registry
#1171
opened Apr 2, 2026 by
Fridah-nv
Loading…
Refactor Qwen3.5 MoE quantization to use _QuantFunctionalMixin
#1170
opened Apr 2, 2026 by
cjluo-nv
Loading…
4 tasks
[NVBug 6045859]Fix export support for Qwen3VL MoE experts
#1164
opened Apr 1, 2026 by
shengliangxu
Loading…
Fix[bug] ONNX models generated by llm_export.py are missing some i/o
#1157
opened Apr 1, 2026 by
Ratheesh1104
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-03-08.