-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[TRTLLM-12250][feat] added lm head sharding
#12252
opened Mar 16, 2026 by
greg-kwasniewski1
Loading…
1 task done
fix(kv_cache): eliminate dangling reference and TOCTOU races in KVCacheManager
#12251
opened Mar 16, 2026 by
thorjohnsen
Loading…
3 tasks
[https://nvbugs/5969206][fix] BREAKING: Setting default value of KV cache transfer timeout to 60s
#12249
opened Mar 16, 2026 by
pcastonguay
Loading…
1 task done
[None][fix] Fix the issue of excluding all context attention kernels when building for sm103
#12248
opened Mar 16, 2026 by
yifeizhang-c
Loading…
1 task done
[https://nvbugs/5895249][fix] Update test waives
#12247
opened Mar 16, 2026 by
greg-kwasniewski1
Loading…
1 task done
[None][feat] Refactor the routing part in trtllmgen
#12246
opened Mar 16, 2026 by
ChristinaZ
Loading…
1 task
[None][fix] Relax MoE test tolerance for fp16 TP mode accuracy mismatch
#12244
opened Mar 16, 2026 by
xxi-nv
Loading…
1 task done
[#11526][chore] AutoDeploy accuracy tests: use nemotron-3 official checkpoints
#12243
opened Mar 16, 2026 by
galagam
Loading…
1 task done
[#12227][fix] Add timeout to MultiProcessExecutor shutdown to prevent test hangs
Community want to contribute
PRs initiated from Community
#12241
opened Mar 16, 2026 by
edenfunf
Loading…
4 tasks done
[#12183][fix] Fix TRTLLM-Gen NVFP4 MoE scales for mixed-precision che…
#12240
opened Mar 16, 2026 by
tcherckez-nvidia
Loading…
1 task done
[TRTLLM-8922][feat] gen-first disagg scheduling, part 2
#12239
opened Mar 16, 2026 by
reasonsolo
•
Draft
1 task
[https://nvbugs/5973214][fix] unwaive qwen3 ci test
#12237
opened Mar 16, 2026 by
byshiue
Loading…
1 task
[TRTLLM-10407][perf] Enable CuteDSL indexer_top_k in model
#12236
opened Mar 16, 2026 by
limin2021
Loading…
1 task done
[https://nvbugs/5973536][fix] Route DSA attention through MLA custom …
#12235
opened Mar 16, 2026 by
v-shobhit
Loading…
1 task done
[None][feat] Align AttentionPlugin with EdgeLLM interface
#12233
opened Mar 16, 2026 by
nvyocox
Loading…
9 tasks done
[TRTLLM-9523][fix] Fix and refactor the transfer logic (step 6)
#12231
opened Mar 16, 2026 by
Shixiaowei02
Loading…
1 task done
feat(quantization): add LoRA support for FP4Linear and FP4RowLinear
Community want to contribute
PRs initiated from Community
#12229
opened Mar 15, 2026 by
langzhao-netizen
Loading…
4 tasks done
[None][feat] Add GLM-4.7-Flash and Qwen3.5 NVFP4 models to BTK benchmark registry
AutoDeploy
<NV> AutoDeploy Backend
Community want to contribute
PRs initiated from Community
#12221
opened Mar 15, 2026 by
edenfunf
Loading…
2 tasks done
[None][docs] Update nemotron 3 super deployment to include tool calli…
#12215
opened Mar 14, 2026 by
tijyojwad
Loading…
1 task done
[None][feat] Add /v1/models endpoint to OpenAIDisaggServer
Community want to contribute
PRs initiated from Community
#12213
opened Mar 14, 2026 by
edenfunf
Loading…
3 tasks done
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.