Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[https://nvbugs/5846154][fix] Fix CuteDSL argmax on SM120
#11185 opened Feb 2, 2026 by syuoni Loading…
1 task done
[None][test] Add DGX-Spark multinode perf cases
#11184 opened Feb 2, 2026 by JennyLiu-nv Loading…
1 task done
[https://nvbugs/5854860][fix] Fix cutedsl argmax on sm120
#11181 opened Feb 2, 2026 by dongfengy Loading…
1 task done
[None][fix] Fix chat request bug for modality model Community want to contribute PRs initiated from Community Multimodal Label for issues & PRs regarding Multimodal related objects
#11179 opened Feb 2, 2026 by Lihui-Gu Loading…
1 task
[None][chore] Enable Nemotron Super nvfp4 tests
#11172 opened Feb 1, 2026 by tcherckez-nvidia Loading…
1 task done
[None][chore] Remove closed bugs
#11171 opened Feb 1, 2026 by xinhe-nv Loading…
[None][fix] Remove duplicated MoE Computation with Helix CP+DP
#11167 opened Feb 1, 2026 by brb-nv Loading…
1 task done
[https://nvbugs/5574553][fix] Unwaive tests
#11162 opened Jan 31, 2026 by hyukn Loading…
1 task done
[feat][WIP] improve sharding time
#11161 opened Jan 31, 2026 by taylor-yb-lee Draft
1 task
[None][fix] Add an env var to turn off tinygemm
#11157 opened Jan 31, 2026 by dongfengy Loading…
1 task done
[None][chore] Print memory usage before/after accuracy test in CI AutoDeploy <NV> AutoDeploy Backend
#11155 opened Jan 30, 2026 by taylor-yb-lee Loading…
1 task done
ProTip! Type g i on any issue or pull request to go back to the issue listing page.