-
Notifications
You must be signed in to change notification settings - Fork 432
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
docs: add modelopt_recipes README and PTQ recipe/scheme guide
#1662
opened Jun 9, 2026 by
cjluo-nv
Collaborator
Loading…
[6241485] Add support for ONNX Q/DQ node placement for DLA
#1661
opened Jun 9, 2026 by
gcunhase
Contributor
Loading…
Add NVFP4 + QAD to the Nemotron-3-Nano-30B-A3B tutorial
#1660
opened Jun 9, 2026 by
kevalmorabia97
Collaborator
•
Draft
Add fused Triton kernel for local-Hessian NVFP4 weight-scale search
#1659
opened Jun 9, 2026 by
Fridah-nv
Contributor
Loading…
docs(deployment skill): drop wrong "release predates arch" cu130 fallback
#1654
opened Jun 8, 2026 by
Edwardf0t1
Contributor
Loading…
feat(deepseek): add --cast_mxfp4_to_nvfp4 to deepseek_v4 quantize step
cherry-pick-0.45.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1653
opened Jun 8, 2026 by
cjluo-nv
Collaborator
Loading…
docs(eval skill): drop arch-specific cu130 nightly tag (release images are multi-arch)
#1649
opened Jun 8, 2026 by
cjluo-nv
Collaborator
Loading…
4 of 5 tasks
[OMNIML-4944] peft: add lora_dtype field to PEFTAttributeConfig
#1646
opened Jun 8, 2026 by
hychiang-git
Contributor
•
Draft
[OMNIML-4962] specdec_bench cell t0_d3 — Qwen/Qwen3.5-4B / DFlash / vLLM
#1638
opened Jun 5, 2026 by
ChenhanYu
Collaborator
Loading…
fix(export): correct unified_export_megatron at EP > 1 and DP > 1
#1631
opened Jun 4, 2026 by
yueshen2016
Contributor
Loading…
3 of 4 tasks
[6058841] Consistent types on If/Loop/Scan subgraphs during FP16/BF16 conversion
cherry-pick-0.45.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
[6058907] Fix ShapeInferenceError in ONNX int8+fp16 quantization of weakly-typed models
cherry-pick-0.45.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1627
opened Jun 4, 2026 by
ajrasane
Contributor
Loading…
DFlash speculative decoding for MiniMax-M2.7 (FSDP2): auto mask-token, FSDP2 resume fixes, per-checkpoint draft export
#1621
opened Jun 3, 2026 by
yeyu-nvidia
Contributor
Loading…
Add W4A16 NVFP4-MSE Qwen3.5 dense/MoE PTQ recipes
#1620
opened Jun 3, 2026 by
cjluo-nv
Collaborator
Loading…
[Feat]: Specdec Streaming: RDMA + Multinode
#1611
opened Jun 2, 2026 by
h-guo18
Contributor
Loading…
Fix torch import error to remove circular dependency & move Nemotron configs
#1606
opened Jun 2, 2026 by
jenchen13
Contributor
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.