Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[https://nvbugs/6225775][fix] Capture count penalty in CUDA graphs
#15216 opened Jun 10, 2026 by chuangz0 Collaborator Draft
1 task done
[#15178][fix] Fix unified-memory Mamba KV estimation
#15215 opened Jun 10, 2026 by peter941221 Loading…
1 task done
[https://nvbugs/6245279][fix] AutoDeploy: Unwaive accuracy tests
#15214 opened Jun 10, 2026 by galagam Collaborator Loading…
1 task done
[None][chore] Fix lock_infra_error
#15213 opened Jun 10, 2026 by yufeiwu-nv Collaborator Loading…
1 task done
[https://nvbugs/6225775][fix] Fix spec count graph
#15212 opened Jun 10, 2026 by chuangz0 Collaborator Loading…
1 task
[None][perf] executor: memcpy int32 token buffer into tle::Request ctor
#15211 opened Jun 10, 2026 by hyukn Collaborator Draft
3 tasks done
[None][infra] Record CBTS decision to OpenSearch for CI-health monitoring
#15210 opened Jun 10, 2026 by crazydemo Collaborator Loading…
1 task done
[TRTLLM-13104][feat] Use checkpoint MTP kernel for mamba
#15209 opened Jun 10, 2026 by Wanli-Jiang Collaborator Loading…
1 task done
[TRTLLM-11408][test] Add e2e Tensor Parallel LPIPS tests for VisualGen
#15208 opened Jun 10, 2026 by yingguo-trt Collaborator Loading…
1 task done
Codex/dynamo router trtllm
#15207 opened Jun 10, 2026 by reasonsolo Collaborator Draft
1 task
[https://nvbugs/6094068][fix] Fix Qwen3-Next bf16 4gpu test
#15206 opened Jun 10, 2026 by JadoTu Collaborator Loading…
1 task done
[TRTLLM-12807][feat] Add multiple FMHA library support to TRTLLM attention backend
#15204 opened Jun 10, 2026 by yuxianq Collaborator Loading…
1 task done
[TRTLLM-35882][feat] cute dsl gvr-top multi-cta optimization
#15198 opened Jun 10, 2026 by limin2021 Collaborator Loading…
1 task done
[None][fix] Support V2 KV cache beam search
#15197 opened Jun 10, 2026 by yizhang-nv Member Draft
1 task done
[TRTLLM-13349][perf] Fuse gemma RMSNorm into AllReduce for Qwen3-Next/Qwen3.5…
#15194 opened Jun 10, 2026 by nv-guomingz Collaborator Loading…
1 task done
[None][test] Add more error signature for stress test
#15193 opened Jun 10, 2026 by fredricz-20070104 Collaborator Loading…
[#15191][doc] Fix broken AutoDeploy README link
#15192 opened Jun 10, 2026 by peter941221 Loading…
1 task done
ProTip! Type g i on any issue or pull request to go back to the issue listing page.