Skip to content

Pull requests: NVIDIA/Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

docs: add modelopt_recipes README and PTQ recipe/scheme guide
#1662 opened Jun 9, 2026 by cjluo-nv Collaborator Loading…
[6241485] Add support for ONNX Q/DQ node placement for DLA
#1661 opened Jun 9, 2026 by gcunhase Contributor Loading…
feat(deepseek): add --cast_mxfp4_to_nvfp4 to deepseek_v4 quantize step cherry-pick-0.45.0 After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1653 opened Jun 8, 2026 by cjluo-nv Collaborator Loading…
docs(eval skill): drop arch-specific cu130 nightly tag (release images are multi-arch)
#1649 opened Jun 8, 2026 by cjluo-nv Collaborator Loading…
4 of 5 tasks
Support MCore auto_quantize calibration updates
#1639 opened Jun 5, 2026 by realAsma Contributor Draft
Add NVFP4 fakequant for attention BMMs
#1635 opened Jun 5, 2026 by kaix-nv Contributor Draft
fix(export): correct unified_export_megatron at EP > 1 and DP > 1
#1631 opened Jun 4, 2026 by yueshen2016 Contributor Loading…
3 of 4 tasks
[6058841] Consistent types on If/Loop/Scan subgraphs during FP16/BF16 conversion cherry-pick-0.45.0 After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1628 opened Jun 4, 2026 by ajrasane Contributor Draft
[6058907] Fix ShapeInferenceError in ONNX int8+fp16 quantization of weakly-typed models cherry-pick-0.45.0 After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1627 opened Jun 4, 2026 by ajrasane Contributor Loading…
Skip-Softmax calibration in vLLM
#1622 opened Jun 3, 2026 by kaix-nv Contributor Draft
Add W4A16 NVFP4-MSE Qwen3.5 dense/MoE PTQ recipes
#1620 opened Jun 3, 2026 by cjluo-nv Collaborator Loading…
[Feat]: Specdec Streaming: RDMA + Multinode
#1611 opened Jun 2, 2026 by h-guo18 Contributor Loading…
ProTip! Add no:assignee to see everything that’s not assigned.