-
Notifications
You must be signed in to change notification settings - Fork 2.5k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[https://nvbugs/6293015][fix] Add a delegating `@property def vocab_size_padded(self) -> int: return…
#15219
opened Jun 10, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[None][fix] Revert "Merge Eagle3 and MTP-eagle one-model workers (#12353)"
#15217
opened Jun 10, 2026 by
tensorrt-cicd
Collaborator
Loading…
[#15178][fix] Fix unified-memory Mamba KV estimation
#15215
opened Jun 10, 2026 by
peter941221
Loading…
1 task done
[https://nvbugs/6245279][fix] AutoDeploy: Unwaive accuracy tests
#15214
opened Jun 10, 2026 by
galagam
Collaborator
Loading…
1 task done
[None][chore] Fix lock_infra_error
#15213
opened Jun 10, 2026 by
yufeiwu-nv
Collaborator
Loading…
1 task done
[https://nvbugs/6225775][fix] Fix spec count graph
#15212
opened Jun 10, 2026 by
chuangz0
Collaborator
Loading…
1 task
[None][infra] Record CBTS decision to OpenSearch for CI-health monitoring
#15210
opened Jun 10, 2026 by
crazydemo
Collaborator
Loading…
1 task done
[TRTLLM-13104][feat] Use checkpoint MTP kernel for mamba
#15209
opened Jun 10, 2026 by
Wanli-Jiang
Collaborator
Loading…
1 task done
[TRTLLM-11408][test] Add e2e Tensor Parallel LPIPS tests for VisualGen
#15208
opened Jun 10, 2026 by
yingguo-trt
Collaborator
Loading…
1 task done
[https://nvbugs/6094068][fix] Fix Qwen3-Next bf16 4gpu test
#15206
opened Jun 10, 2026 by
JadoTu
Collaborator
Loading…
1 task done
[TRTLLM-12807][feat] Add multiple FMHA library support to TRTLLM attention backend
#15204
opened Jun 10, 2026 by
yuxianq
Collaborator
Loading…
1 task done
[https://nvbugs/6287721][fix] After creating the default placement group, block on `ray.get(pg.ready()…
#15203
opened Jun 10, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[None][feat] support Mamba state snapshots for block reuse
#15202
opened Jun 10, 2026 by
VALLIS-NERIA
Collaborator
•
Draft
[None][perf] CuTeDSL MegaMoE: eliminate per-launch workspace memset o…
#15201
opened Jun 10, 2026 by
Barry-Delaney
Collaborator
•
Draft
1 task
[https://nvbugs/6292661][fix] Add a
_ray_init_local_with_retry static helper to RayExecutor (3 attempts…
#15200
opened Jun 10, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[TRTLLM-35882][feat] cute dsl gvr-top multi-cta optimization
#15198
opened Jun 10, 2026 by
limin2021
Collaborator
Loading…
1 task done
[None][fix] Support V2 KV cache beam search
#15197
opened Jun 10, 2026 by
yizhang-nv
Member
•
Draft
1 task done
[https://nvbugs/6290962][fix] Replace the missing
pipeline.model_config accessor with…
#15196
opened Jun 10, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[TRTLLM-13349][perf] Fuse gemma RMSNorm into AllReduce for Qwen3-Next/Qwen3.5…
#15194
opened Jun 10, 2026 by
nv-guomingz
Collaborator
Loading…
1 task done
[None][test] Add more error signature for stress test
#15193
opened Jun 10, 2026 by
fredricz-20070104
Collaborator
Loading…
[#15191][doc] Fix broken AutoDeploy README link
#15192
opened Jun 10, 2026 by
peter941221
Loading…
1 task done
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.