Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][feat] wide_ep support block-wise FP8 on blackwell
#7376 opened Aug 29, 2025 by xxi-nv Loading…
1 task done
[None] [fix] store blog 10 media via lfs
#7375 opened Aug 29, 2025 by Funatiq Loading…
1 task done
[None][chore] rm executor config in kv cache connector
#7372 opened Aug 29, 2025 by leslie-fang25 Loading…
1 task done
[TRTLLM-6642][feat] add gptoss 20g tests
#7361 opened Aug 29, 2025 by xinhe-nv Draft
1 task
[None] [doc] Update DeepSeek example doc
#7358 opened Aug 29, 2025 by jiahanc Loading…
1 task
[None][fix] Revert TP Sharding read from the model config (#6972) bug Something isn't working
#7356 opened Aug 29, 2025 by lucaslie Loading…
1 task done
Perf/gpt oss eagle
#7353 opened Aug 28, 2025 by ameynaik-hub Loading…
[None][chore] Fix formatting error in Gemma3 readme
#7352 opened Aug 28, 2025 by karljang Loading…
1 task done
[TRTLLM-5059][feat] Add kv cache reuse for LlavaNext
#7349 opened Aug 28, 2025 by chang-l Loading…
1 task
[None][fix] Fix KV cache recompute in draft_target spec decode
#7348 opened Aug 28, 2025 by mikeiovine Loading…
1 task done
[None][doc] Update doc for multimodal
#7347 opened Aug 28, 2025 by chang-l Loading…
1 task
[None][chore] Add env var to disable harmony
#7343 opened Aug 28, 2025 by LinPoly Loading…
1 task done
[TRTLLM-7250][fix] Add failed cases into waives.txt
#7342 opened Aug 28, 2025 by xinhe-nv Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.