Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][chore] TEST ONLY: Pre-merge CI validation
#15929 opened Jul 3, 2026 by jiaganc Collaborator Draft
1 task done
[None][fix] Enable MiniMax M3 piecewise CUDA graphs
#15923 opened Jul 3, 2026 by liji-nv Collaborator Loading…
1 task done
[None][perf] Move greedy stop checks to host
#15920 opened Jul 3, 2026 by mingyangHao Collaborator Loading…
1 task
[None][feat] Add DeepSeek-V4-Pro curated configs
#15919 opened Jul 3, 2026 by lfr-0531 Collaborator Draft
1 task done
[TRTLLM-14022][feat] Remove legacy TensorRT Python backend
#15918 opened Jul 3, 2026 by Wanli-Jiang Collaborator Draft
1 task done
[https://nvbugs/6405665][test] Disable block reuse for KV cache comparison
#15917 opened Jul 3, 2026 by jiaganc Collaborator Loading…
1 task done
[None][feat] integrate commit-min snapshots with V2 reuse policies
#15916 opened Jul 3, 2026 by jiaganc Collaborator Draft
1 task done
[None][test] Dump sibling worker logs from disagg BENCHMARK pytest
#15912 opened Jul 3, 2026 by chenfeiz0326 Collaborator Loading…
4 tasks done
[None][test] Add opt-in background prefetch of test MPI sessions and model page cache
#15908 opened Jul 3, 2026 by sunnyqgg Collaborator Loading…
3 tasks done
[TRTLLM-13784][chore] Remove legacy TensorRT-engine Triton backend
#15907 opened Jul 3, 2026 by Wanli-Jiang Collaborator Loading…
1 task done
[None][Perf] Update CuTeDSL MegaMoE kernels
#15906 opened Jul 3, 2026 by Barry-Delaney Collaborator Draft
[None][feat] Disagg coordinator + orchestrator fleet
#15905 opened Jul 3, 2026 by reasonsolo Collaborator Draft
1 task done
ProTip! Filter pull requests by the default branch with base:main.