[mxfp] handle values close to max correctly w/o overflow #8356

jongsoo-openai · 2025-10-02T22:32:29Z

Need to clamp since due to rounding, we can have overflow that was within
the range before quantization.
e.g., 3.3895e+38 -> log2(3.3895e+38 / max_fp8e4m3=448) ~= 119.17 -> round
up to 120 + exp_bias=127 -> scale=247
3.3895e+38 / 2120 ~= 254.9976 -> round to 256 in fp8e4m3fn
Dequantization: 256 * 2120 > 3.4e38 overflowing 3.38953139e38

New contributor declaration

I am not making a trivial change, such as fixing a typo in a comment.
I have written a PR description following these
rules.
I have run pre-commit run --from-ref origin/main --to-ref HEAD.
Select one of the following.
- I have added tests.
  - /test for lit tests
  - /unittest for C++ tests
  - /python/test for end-to-end tests
- This PR does not need a test because FILL THIS IN.
Select one of the following.
- I have not added any lit tests.
- The lit tests I have added follow these best practices,
  including the "tests should be minimal" section. (Usually running Python code
  and using the instructions it generates is not minimal.)

jongsoo-openai · 2025-10-03T22:03:43Z

/merge

…#8356) Need to clamp since due to rounding, we can have overflow that was within the range before quantization. e.g., 3.3895e+38 -> log2(3.3895e+38 / max_fp8e4m3=448) ~= 119.17 -> round up to 120 + exp_bias=127 -> scale=247 3.3895e+38 / 2**120 ~= 254.9976 -> round to 256 in fp8e4m3fn Dequantization: 256 * 2**120 > 3.4e38 overflowing 3.38953139e38  # New contributor declaration - [x] I am not making a trivial change, such as fixing a typo in a comment. - [x] I have written a PR description following these [rules](https://cbea.ms/git-commit/#why-not-how). - [x] I have run `pre-commit run --from-ref origin/main --to-ref HEAD`. - Select one of the following. - [x] I have added tests. - `/test` for `lit` tests - `/unittest` for C++ tests - `/python/test` for end-to-end tests - [ ] This PR does not need a test because `FILL THIS IN`. - Select one of the following. - [x] I have not added any `lit` tests. - [ ] The `lit` tests I have added follow these [best practices](https://mlir.llvm.org/getting_started/TestingGuide/#filecheck-best-practices), including the "tests should be minimal" section. (Usually running Python code and using the instructions it generates is not minimal.)

Need to clamp since due to rounding, we can have overflow that was within the range before quantization. e.g., 3.3895e+38 -> log2(3.3895e+38 / max_fp8e4m3=448) ~= 119.17 -> round up to 120 + exp_bias=127 -> scale=247 3.3895e+38 / 2**120 ~= 254.9976 -> round to 256 in fp8e4m3fn Dequantization: 256 * 2**120 > 3.4e38 overflowing 3.38953139e38  # New contributor declaration - [x] I am not making a trivial change, such as fixing a typo in a comment. - [x] I have written a PR description following these [rules](https://cbea.ms/git-commit/#why-not-how). - [x] I have run `pre-commit run --from-ref origin/main --to-ref HEAD`. - Select one of the following. - [x] I have added tests. - `/test` for `lit` tests - `/unittest` for C++ tests - `/python/test` for end-to-end tests - [ ] This PR does not need a test because `FILL THIS IN`. - Select one of the following. - [x] I have not added any `lit` tests. - [ ] The `lit` tests I have added follow these [best practices](https://mlir.llvm.org/getting_started/TestingGuide/#filecheck-best-practices), including the "tests should be minimal" section. (Usually running Python code and using the instructions it generates is not minimal.)

…#8356) Need to clamp since due to rounding, we can have overflow that was within the range before quantization. e.g., 3.3895e+38 -> log2(3.3895e+38 / max_fp8e4m3=448) ~= 119.17 -> round up to 120 + exp_bias=127 -> scale=247 3.3895e+38 / 2**120 ~= 254.9976 -> round to 256 in fp8e4m3fn Dequantization: 256 * 2**120 > 3.4e38 overflowing 3.38953139e38  # New contributor declaration - [x] I am not making a trivial change, such as fixing a typo in a comment. - [x] I have written a PR description following these [rules](https://cbea.ms/git-commit/#why-not-how). - [x] I have run `pre-commit run --from-ref origin/main --to-ref HEAD`. - Select one of the following. - [x] I have added tests. - `/test` for `lit` tests - `/unittest` for C++ tests - `/python/test` for end-to-end tests - [ ] This PR does not need a test because `FILL THIS IN`. - Select one of the following. - [x] I have not added any `lit` tests. - [ ] The `lit` tests I have added follow these [best practices](https://mlir.llvm.org/getting_started/TestingGuide/#filecheck-best-practices), including the "tests should be minimal" section. (Usually running Python code and using the instructions it generates is not minimal.)

[mxfp] handle values close to max correctly w/o overflow

54d9bc5

jongsoo-openai force-pushed the 1002_mx_deq_overflow branch from 0fdcca1 to 54d9bc5 Compare October 2, 2025 23:31

jongsoo-openai marked this pull request as ready for review October 3, 2025 01:10

jongsoo-openai requested a review from ptillet as a code owner October 3, 2025 01:10

jongsoo-openai enabled auto-merge (squash) October 3, 2025 21:27

lezcano approved these changes Oct 4, 2025

View reviewed changes

jongsoo-openai merged commit 3910f27 into triton-lang:main Oct 4, 2025
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mxfp] handle values close to max correctly w/o overflow #8356

[mxfp] handle values close to max correctly w/o overflow #8356

Uh oh!

jongsoo-openai commented Oct 2, 2025 •

edited

Loading

Uh oh!

jongsoo-openai commented Oct 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[mxfp] handle values close to max correctly w/o overflow #8356

[mxfp] handle values close to max correctly w/o overflow #8356

Uh oh!

Conversation

jongsoo-openai commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

New contributor declaration

Uh oh!

jongsoo-openai commented Oct 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jongsoo-openai commented Oct 2, 2025 •

edited

Loading