Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

minor: some fix and cleanup for trtllm-gen mha
#1302 opened Jul 22, 2025 by yyihuang Loading…
5 tasks done
3rparty: upgrade cutlass dependency to v4.1.0
#1299 opened Jul 22, 2025 by yzh119 Loading…
5 tasks
Add weight layout
#1297 opened Jul 21, 2025 by aleozlx Draft
4 of 5 tasks
add mm_fp4 use cutlass backend for large bs
#1296 opened Jul 21, 2025 by ttyio Loading…
5 tasks done
Update cutlass fp4 moe kernels
#1294 opened Jul 21, 2025 by wenscarl Loading…
5 tasks
Add native cudnn_decode for improved cudnn decode performance
#1283 opened Jul 18, 2025 by Anerudhan Loading…
5 tasks done
ci: add github actions to upload sdist to pypi
#1270 opened Jul 16, 2025 by yzh119 Loading…
5 tasks
Bug fix: fix duplicate launch in POD
#1267 opened Jul 16, 2025 by Edenzzzz Loading…
5 tasks
feat(aot): add nvshmem module for aot compilation
#1261 opened Jul 15, 2025 by EmilienM Loading…
3 of 5 tasks
refactor: separate SM100 and legacy TRT-LLM comm modules
#1259 opened Jul 15, 2025 by EmilienM Loading…
3 of 5 tasks
Mnnvl memory with custom communicator
#1245 opened Jul 14, 2025 by wenscarl Draft
5 tasks
feat: Restore convenience FLASHINFER_ENABLE_AOT option
#1235 opened Jul 8, 2025 by mgorny Loading…
3 of 5 tasks
[Feature] Support batch prefill for POD Attention
#1231 opened Jul 8, 2025 by Edenzzzz Loading…
7 tasks
Add ruff to pre-commit
#1201 opened Jul 1, 2025 by cyx-6 Draft
5 tasks
Add mypy to pre-commit
#1179 opened Jun 26, 2025 by cyx-6 Draft
5 tasks
Use flashinfer softmax in top_k_top_p_sampling_from_logits
#1171 opened Jun 24, 2025 by lgeiger Loading…
5 tasks done
[wip] Multimem allreduce cutlass dsl
#1169 opened Jun 23, 2025 by Amir-19 Draft
5 tasks
ProTip! Follow long discussions with comments:>50.