File tree
805 files changed
+70383
-9461
lines changed- .github
- ISSUE_TEMPLATE
- workflows
- applications
- dual_gemm/collective
- flash_attention_v2/collective
- examples
- 01_cutlass_utilities
- 05_batched_gemm
- 07_volta_tensorop_gemm
- 08_turing_tensorop_gemm
- 09_turing_tensorop_conv2dfprop
- 13_two_tensor_op_fusion
- device
- kernel
- threadblock
- 16_ampere_tensorop_conv2dfprop
- 18_ampere_fp64_tensorop_affine2_gemm
- 19_tensorop_canonical
- 20_simt_canonical
- 35_gemm_softmax
- 36_gather_scatter_fusion
- 37_gemm_layernorm_gemm_fusion
- 39_gemm_permute
- 40_cutlass_py/customizable
- 41_fused_multi_head_attention
- epilogue
- gemm
- iterators
- 43_ell_block_sparse_gemm
- 44_multi_gemm_ir_and_codegen
- fixed_impl/epilogue/threadblock
- ir_gen
- 45_dual_gemm/threadblock
- 49_hopper_gemm_with_collective_builder
- 50_hopper_gemm_with_epilogue_swizzle
- 52_hopper_gather_scatter_fusion
- 53_hopper_gemm_permute
- 56_hopper_ptr_array_batched_gemm
- 57_hopper_grouped_gemm
- 58_ada_fp8_gemm
- 59_ampere_gather_scatter_conv
- 63_hopper_gemm_with_weight_prefetch/collective
- 64_ada_fp8_gemm_grouped
- 65_distributed_gemm
- 67_hopper_fp8_warp_specialized_gemm_with_blockwise_scaling
- 68_hopper_fp8_warp_specialized_grouped_gemm_with_blockwise_scaling
- 70_blackwell_gemm
- 71_blackwell_gemm_with_collective_builder
- 72_blackwell_narrow_precision_gemm
- 73_blackwell_gemm_preferred_cluster
- 74_blackwell_gemm_streamk
- 75_blackwell_grouped_gemm
- 76_blackwell_conv
- 77_blackwell_fmha
- collective
- common
- device
- kernel
- reference
- 78_blackwell_emulated_bf16x9_gemm
- 79_blackwell_geforce_gemm
- 80_blackwell_geforce_sparse_gemm
- 81_blackwell_gemm_blockwise
- 82_blackwell_distributed_gemm
- 84_blackwell_narrow_precision_sparse_gemm
- 86_blackwell_mixed_dtype_gemm
- 87_blackwell_geforce_gemm_blockwise
- 88_hopper_fmha
- collective
- device
- kernel
- reference
- 89_sm103_fp4_ultra_gemm
- 90_sm103_fp4_ultra_grouped_gemm
- 91_fp4_gemv
- common
- cute/tutorial
- blackwell
- hopper
- python/deprecated
- include
- cute
- algorithm
- arch
- atom
- container
- numeric
- util
- cutlass
- arch
- conv
- collective
- device
- kernel
- detail
- collective
- epilogue
- collective
- builders
- fusion
- threadblock
- fusion
- thread
- experimental/distributed/device
- gemm
- collective
- builders
- device
- kernel
- threadblock
- warp
- layout
- pipeline
- platform
- transform
- collective
- threadblock
- warp
- media/docs/cpp
- build
- cute
- python
- cutlass_library
- cutlass
- backend
- evt
- backend
- frontend
- ir
- passes
- utils
- emit
- epilogue
- op
- utils
- pycute
- test
- python/cutlass
- conv2d
- emit
- evt
- utils
- gemm
- interface
- self_contained_includes
- unit
- common
- conv/device_3x
- dgrad
- fprop
- wgrad
- core
- cute
- ampere
- core
- turing
- gemm/device
- sm100_blockscaled_sparse_tensorop_gemm
- sm100_sparse_tensorop_gemm
- narrow_precision
- sm100_tensorop_gemm
- layout
- nvrtc/thread
- tools
- library
- include/cutlass/library
- src
- reference
- profiler
- include/cutlass/profiler
- src
- util/include/cutlass/util
- reference/host
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
805 files changed
+70383
-9461
lines changedThis file was deleted.
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + |
This file was deleted.
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + |
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + |
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + |
0 commit comments