Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][doc] Add MoE developer guide for fused_moe module
#12534 opened Mar 25, 2026 by xxi-nv Loading…
2 tasks done
[https://nvbugs/6007967][fix] fix disagg pp hang issue
#12528 opened Mar 25, 2026 by bo-nv Loading…
1 task
[https://nvbugs/6007197][fix] Adjust RocketKV test threshold
#12527 opened Mar 25, 2026 by heyuhhh Loading…
1 task done
[None][doc] Fix duplicate words in comments
#12524 opened Mar 25, 2026 by YihuiLu512 Loading…
1 task done
Adds a LMCache v1 KV connector example (llm_lmcache_connector.py) that Community want to contribute PRs initiated from Community
#12522 opened Mar 25, 2026 by feixiangpeng Loading…
1 task
[None][test] Add different input-output of eagle cases on Spark
#12520 opened Mar 25, 2026 by JennyLiu-nv Loading…
1 task done
[#11992][fix] Support include_stop_token_in_output in gRPC request manager Community want to contribute PRs initiated from Community
#12517 opened Mar 24, 2026 by CatherineSue Loading…
3 tasks done
ProTip! no:milestone will show everything without a milestone.