[BENCHMARK] Reuse CUTLASS's gemm configuration file #4720

jle-quel · 2025-07-15T13:50:43Z

Description

This PR introduces a new mechanism for fetching GEMM configurations.

Instead of hardcoding the (shape → config) mapping, the config-tool.py script now parses a configuration file and generates the gemm_config structure dynamically.

The configuration file consists of a list of GEMM kernel invocations with the corresponding GemmConfig. These will be extracted and used to invoke the kernel with the appropriate configuration.

Note

Currently, the configuration file is located in benchmarks/cutlass_kernel/gemm. In the future, this should be updated to fetch the file directly from the CUTLASS repository: https://github.com/intel/cutlass-sycl
This change will be made once the CUTLASS repo includes a unified file containing the optimal configurations for all shapes used in the Triton benchmark.

Instead of creating our own mapping from problem shape to CUTLASS GEMM configuration, re-use existing information in CUTLASS. This adds a small tool that can parse CUTLASS' benchmark configuration files and generate a C++ header with the problem shape to configuration mapping. The generated header is included in the CUTLASS kernel benchmark to dispatch to the best known configuration for each problem shape. Signed-off-by: Lukas Sommer <[email protected]>

…into sommerlukas/reuse-cutlass-gemm-config Signed-off-by: Jefferson Le Quellec <[email protected]>

Signed-off-by: Jefferson Le Quellec <[email protected]>

anmyachev · 2025-07-25T14:00:45Z

This change will be made once the CUTLASS repo includes a unified file containing the optimal configurations for all shapes used in the Triton benchmark.

Hi @jle-quel. Is there a tracker for this yet? So we can monitor it more easily.

benchmarks/cutlass_kernel/gemm/gemm.hpp

jle-quel · 2025-07-28T10:29:18Z

This change will be made once the CUTLASS repo includes a unified file containing the optimal configurations for all shapes used in the Triton benchmark.

Hi @jle-quel. Is there a tracker for this yet? So we can monitor it more easily.

There is a ticket on our side that is tracking it : #4775

whitneywhtsang

Please double check no unexpected performance impact on the CUTLASS GEMM before merging.

Started CI: https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/16576519983

whitneywhtsang · 2025-07-28T21:00:25Z

Potential performance degradation on CUTLASS gemm: Grafana

sommerlukas · 2025-07-29T08:03:13Z

Potential performance degradation on CUTLASS gemm: Grafana

As neither the previous nor this configuration are delivering the best known performance for GEMM in CUTLASS, I'd suggest to still merge this PR with the basic infrastructure for the config files and add/use the correct configuration in a future PR.

sommerlukas and others added 2 commits May 23, 2025 16:31

Merge branch 'main' of github.com:intel/intel-xpu-backend-for-triton …

1c81a12

…into sommerlukas/reuse-cutlass-gemm-config Signed-off-by: Jefferson Le Quellec <[email protected]>

jle-quel marked this pull request as draft July 15, 2025 13:51

jle-quel changed the title ~~[BENCHMARK] Reuse CUTLASS gemm config~~ [DRAFT][BENCHMARK] Reuse CUTLASS's gemm configuration file Jul 15, 2025

jle-quel added 2 commits July 23, 2025 18:41

Use temporary gemm config input

e16f99a

Signed-off-by: Jefferson Le Quellec <[email protected]>

Apply formatting

e6dd7e0

Signed-off-by: Jefferson Le Quellec <[email protected]>

jle-quel mentioned this pull request Jul 25, 2025

[BENCHMARK] Use GEMM configuration file from CUTLASS #4775

Open

jle-quel marked this pull request as ready for review July 25, 2025 09:52

jle-quel changed the title ~~[DRAFT][BENCHMARK] Reuse CUTLASS's gemm configuration file~~ [BENCHMARK] Reuse CUTLASS's gemm configuration file Jul 25, 2025

jle-quel requested review from anmyachev, a team and whitneywhtsang July 25, 2025 09:52

sommerlukas approved these changes Jul 25, 2025

View reviewed changes

anmyachev reviewed Jul 25, 2025

View reviewed changes

benchmarks/cutlass_kernel/gemm/gemm.hpp Show resolved Hide resolved

anmyachev approved these changes Jul 25, 2025

View reviewed changes

whitneywhtsang approved these changes Jul 28, 2025

View reviewed changes

etiotto merged commit 82f505a into main Jul 29, 2025
18 checks passed

etiotto deleted the sommerlukas/reuse-cutlass-gemm-config branch July 29, 2025 19:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BENCHMARK] Reuse CUTLASS's gemm configuration file #4720

[BENCHMARK] Reuse CUTLASS's gemm configuration file #4720

Uh oh!

jle-quel commented Jul 15, 2025 •

edited

Loading

Uh oh!

anmyachev commented Jul 25, 2025

Uh oh!

Uh oh!

jle-quel commented Jul 28, 2025

Uh oh!

whitneywhtsang left a comment •

edited

Loading

Uh oh!

whitneywhtsang commented Jul 28, 2025

Uh oh!

sommerlukas commented Jul 29, 2025

Uh oh!

Uh oh!

Uh oh!

[BENCHMARK] Reuse CUTLASS's gemm configuration file #4720

[BENCHMARK] Reuse CUTLASS's gemm configuration file #4720

Uh oh!

Conversation

jle-quel commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Note

Uh oh!

anmyachev commented Jul 25, 2025

Uh oh!

Uh oh!

jle-quel commented Jul 28, 2025

Uh oh!

whitneywhtsang left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

whitneywhtsang commented Jul 28, 2025

Uh oh!

sommerlukas commented Jul 29, 2025

Uh oh!

Uh oh!

Uh oh!

jle-quel commented Jul 15, 2025 •

edited

Loading

whitneywhtsang left a comment •

edited

Loading