-
Notifications
You must be signed in to change notification settings - Fork 1.7k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[https://nvbugs/5445466][fix] Eliminate race when loading HF dynamic modules (#7268)
#7379
opened Aug 29, 2025 by
chang-l
Loading…
1 task
[None][Doc] Polish parallelism doc and add wide ep sectionwq.
1.0_doc
#7378
opened Aug 29, 2025 by
nv-guomingz
Loading…
1 task
[TRTLLM-7008][fix] Add automatic shared memory delete if already exist
#7377
opened Aug 29, 2025 by
dongxuy04
Loading…
1 task done
[None][feat] wide_ep support block-wise FP8 on blackwell
#7376
opened Aug 29, 2025 by
xxi-nv
Loading…
1 task done
[https://nvbugs/5485325][fix] Add a postprocess to the model engine to fix the CUDA graph warmup issue when using speculative decoding
#7373
opened Aug 29, 2025 by
lfr-0531
Loading…
1 task done
[None][chore] rm executor config in kv cache connector
#7372
opened Aug 29, 2025 by
leslie-fang25
Loading…
1 task done
[TRTLLM-7261][feat] Support phi-4 model in pytorch backend
#7371
opened Aug 29, 2025 by
Wanli-Jiang
•
Draft
1 task
[TRTLLM-6747][feat] Merge add sparse exp and shared exp into local reduction
#7369
opened Aug 29, 2025 by
zongfeijing
Loading…
1 task
[https://nvbugs/5485593][fix] improve accuracy/test_disaggregated_serving.py
#7366
opened Aug 29, 2025 by
reasonsolo
Loading…
1 task
[TRTLLM-7279][test] add accuracy test for deepseek-r1 with chunked_prefill
#7365
opened Aug 29, 2025 by
crazydemo
Loading…
1 task done
[TRTLLM-7330][feat]Eagle3 cuda graph support for the 1st draft model inference
Community want to contribute
PRs initiated from Community
#7363
opened Aug 29, 2025 by
sunnyqgg
Loading…
1 task
[TRTLLM-7410][feat] Enable video modality for hashing/kv_reuse and generalize finding mm_token_length
#7360
opened Aug 29, 2025 by
chang-l
Loading…
1 task
[None][fix] Revert TP Sharding read from the model config (#6972)
bug
Something isn't working
#7356
opened Aug 29, 2025 by
lucaslie
Loading…
1 task done
[None][chore] Fix formatting error in Gemma3 readme
#7352
opened Aug 28, 2025 by
karljang
Loading…
1 task done
[TRTLLM-5059][feat] Add kv cache reuse for LlavaNext
#7349
opened Aug 28, 2025 by
chang-l
Loading…
1 task
[None][fix] Fix KV cache recompute in draft_target spec decode
#7348
opened Aug 28, 2025 by
mikeiovine
Loading…
1 task done
[None][chore] Add env var to disable harmony
#7343
opened Aug 28, 2025 by
LinPoly
Loading…
1 task done
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.