Add `F.scaled_mm` #2720

crcrpar · 2025-11-06T04:31:31Z

What does this PR do?

As per title, this PR adds F.scaled_mm to thunder.torch and cover it with torchex impl.

Ref: https://docs.pytorch.org/docs/main/generated/torch.nn.functional.scaled_mm.html

Copilot

Pull Request Overview

This PR adds support for torch.nn.functional.scaled_mm operation in Thunder. This operation performs scaled matrix multiplication, which is commonly used for FP8 quantized operations.

Adds a new scaled_mm function to thunder/torch/__init__.py with input validation and shape inference logic
Registers the implementation in thunder/executors/torchex.py to delegate to PyTorch
Adds comprehensive test coverage with tensor-wise, row-wise, and block-wise scaling tests

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
thunder/torch/init.py	Implements the `scaled_mm` symbol with parameter validation and output shape/dtype inference
thunder/executors/torchex.py	Registers the torch executor implementation for `scaled_mm`
thunder/tests/test_ops.py	Adds test helper functions and comprehensive tests for scaled_mm with different scaling strategies

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

thunder/torch/__init__.py

thunder/tests/test_ops.py

mattteochen · 2025-11-06T09:42:12Z

thunder/torch/__init__.py

+                ValueError,
+            )
+            for enum_value in values:
+                _ = int(enum_value.value) if hasattr(enum_value, "value") else int(enum_value)


Should we send a more detailed error here instead of a TypeError?

thunder/tests/test_ops.py

kshitij12345 · 2025-11-06T13:15:34Z

thunder/torch/__init__.py

+        scale_recipe_a,
+        scale_b,
+        scale_recipe_b,
+        swizzle_a=None,


It would be helpful to add type annotation for scale_a, scale_recipe_a, scale_b, scale_recipe_b, swizzle_a and swizzle_b.

kshitij12345 · 2025-11-06T13:22:25Z

thunder/torch/__init__.py

+            tensor_args.append(bias)
+        utils.check_same_device(*tensor_args)
+
+        result_dtype = to_dtype(output_dtype or torch.bfloat16)


Is the or torch.bfloat16 needed for the case if user explicitly passes output_dtype=None?

I think it's still needed

thunder/torch/__init__.py

thunder/tests/test_ops.py

kshitij12345 · 2025-11-06T13:40:50Z

thunder/tests/test_ops.py

+    if not hasattr(torch.nn.functional, "scaled_mm"):
+        pytest.skip("torch.nn.functional.scaled_mm is not available in this PyTorch build")
+    device = torch.device("cuda")
+    torch.manual_seed(0)


It would be better not to set the seed so that we test with different values.

thunder/tests/test_ops.py

…wise scaling Signed-off-by: Masaki Kozuki <[email protected]>

Signed-off-by: Masaki Kozuki <[email protected]>

kshitij12345

Overall looks good, just a few comments regarding the tests. Thanks!

thunder/tests/test_ops.py

Signed-off-by: Masaki Kozuki <[email protected]>

kshitij12345

LGTM, thanks @crcrpar

crcrpar requested a review from Copilot November 6, 2025 04:31

crcrpar added the operators label Nov 6, 2025

crcrpar requested review from KaelanDt, lantiga, mruberry and t-vi as code owners November 6, 2025 04:31

Copilot AI reviewed Nov 6, 2025

View reviewed changes

thunder/torch/__init__.py Outdated Show resolved Hide resolved

thunder/tests/test_ops.py Show resolved Hide resolved

thunder/tests/test_ops.py Show resolved Hide resolved

crcrpar requested review from kshitij12345 and mattteochen November 6, 2025 08:24

mattteochen reviewed Nov 6, 2025

View reviewed changes

kshitij12345 reviewed Nov 6, 2025

View reviewed changes

Lightning-AI deleted a comment from kshitij12345 Nov 7, 2025

crcrpar commented Nov 7, 2025

View reviewed changes

thunder/tests/test_ops.py Show resolved Hide resolved

crcrpar added 13 commits November 11, 2025 20:08

Add torch.nn.functional.scaled_mm support with CUDA tests of tensor…

305217c

…wise scaling Signed-off-by: Masaki Kozuki <[email protected]>

rowwise tests

143e179

Signed-off-by: Masaki Kozuki <[email protected]>

blockwise tests

4d0074a

Signed-off-by: Masaki Kozuki <[email protected]>

stop specifying cuda:0, test skip decorator

2b72cbf

Signed-off-by: Masaki Kozuki <[email protected]>

availability check of F.scaled_mm

56c60f1

Signed-off-by: Masaki Kozuki <[email protected]>

guard ScalingType/SwizzleType import in test_ops.py

f66d745

Signed-off-by: Masaki Kozuki <[email protected]>

type annotation

9d0d5a0

Signed-off-by: Masaki Kozuki <[email protected]>

rename _tensor_proxy_args

3e25504

Signed-off-by: Masaki Kozuki <[email protected]>

decorator to check scaled_mm availability, remove manual_seed

15b50e8

Signed-off-by: Masaki Kozuki <[email protected]>

fix require_scaled_mm

3e198aa

Signed-off-by: Masaki Kozuki <[email protected]>

why .t

24ffef9

Signed-off-by: Masaki Kozuki <[email protected]>

remove comparison of emulation and reference

e115c2b

Signed-off-by: Masaki Kozuki <[email protected]>

remove _cuda_capability

9aae77b

Signed-off-by: Masaki Kozuki <[email protected]>

crcrpar force-pushed the crpa/scaled_mm branch from e13728e to 9aae77b Compare November 11, 2025 11:08

remove unnecessary variables

c2615bd

Signed-off-by: Masaki Kozuki <[email protected]>

kshitij12345 reviewed Nov 14, 2025

View reviewed changes

thunder/tests/test_ops.py Outdated Show resolved Hide resolved

thunder/tests/test_ops.py Outdated Show resolved Hide resolved

thunder/tests/test_ops.py Outdated Show resolved Hide resolved

thunder/tests/test_ops.py Outdated Show resolved Hide resolved

rename to remove emulation

5c20648

Signed-off-by: Masaki Kozuki <[email protected]>

crcrpar added 2 commits November 17, 2025 04:08

remove unused _dequantize_blockwise

c576bcb

Signed-off-by: Masaki Kozuki <[email protected]>

remove redundant test_scaled_mm_blockwise_matches_emulation

1af92d0

Signed-off-by: Masaki Kozuki <[email protected]>

kshitij12345 approved these changes Nov 17, 2025

View reviewed changes

Add F.scaled_mm #2720

Are you sure you want to change the base?

Add F.scaled_mm #2720

Conversation

crcrpar commented Nov 6, 2025

What does this PR do?

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mattteochen Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kshitij12345 Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

kshitij12345 Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

crcrpar Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kshitij12345 Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kshitij12345 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kshitij12345 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add `F.scaled_mm` #2720

Add `F.scaled_mm` #2720