-
Notifications
You must be signed in to change notification settings - Fork 390
Pull requests: flashinfer-ai/flashinfer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ensure graph is captured and executed on the same stream to avoid rep…
#1303
opened Jul 22, 2025 by
elfiegg
Loading…
minor: some fix and cleanup for trtllm-gen mha
#1302
opened Jul 22, 2025 by
yyihuang
Loading…
5 tasks done
add mm_fp4 use cutlass backend for large bs
#1296
opened Jul 21, 2025 by
ttyio
Loading…
5 tasks done
[fix] fix integer overflow in FA2 customized_mask & add buffer overflow warning.
#1290
opened Jul 19, 2025 by
happierpig
Loading…
5 tasks done
Add native cudnn_decode for improved cudnn decode performance
#1283
opened Jul 18, 2025 by
Anerudhan
Loading…
5 tasks done
use NVSHMEM4Py instead of custom bindings for NVSHMEM MNNVL Allreduce
#1263
opened Jul 15, 2025 by
Amir-19
Loading…
5 tasks
feat(aot): add nvshmem module for aot compilation
#1261
opened Jul 15, 2025 by
EmilienM
Loading…
3 of 5 tasks
refactor: separate SM100 and legacy TRT-LLM comm modules
#1259
opened Jul 15, 2025 by
EmilienM
Loading…
3 of 5 tasks
bugfix: fix fp32 acc threshold for qk using math::inf according to dtype by AIDC-AI
#1247
opened Jul 14, 2025 by
yongchaoding
Loading…
5 tasks done
Add the keyword "template" to member template specialization appears after
.
or ->
in a post-fix expression which is a requirement in C++ standard
#1246
opened Jul 14, 2025 by
tomflinda
Loading…
feat: Restore convenience
FLASHINFER_ENABLE_AOT
option
#1235
opened Jul 8, 2025 by
mgorny
Loading…
3 of 5 tasks
[Feature] Support batch prefill for POD Attention
#1231
opened Jul 8, 2025 by
Edenzzzz
Loading…
7 tasks
Use flashinfer
softmax
in top_k_top_p_sampling_from_logits
#1171
opened Jun 24, 2025 by
lgeiger
Loading…
5 tasks done
Previous Next
ProTip!
Follow long discussions with comments:>50.