[Backend Tester] Add quantized test flows for XNNPACK and Core ML #12733

GregoryComer · 2025-07-22T23:16:19Z

Add quantized test flows core static int8 quantization for XNNPACK and Core ML backends. I also ended up doing some light refactoring on the test signature to pass the TestFlow class into the individual tests. This is done to allow for passing quantization parameters into the inner test.

[ghstack-poisoned]

GregoryComer · 2025-07-22T23:16:20Z

Stack from ghstack (oldest at bottom):

pytorch-bot · 2025-07-22T23:16:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12733

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 77 Pending

As of commit 27cd171 with merge base 29c52b5 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

digantdesai · 2025-07-23T15:28:41Z

backends/apple/coreml/test/tester.py


+import coremltools as ct


come on :p

Suggested change

import coremltools as ct

import coremltools

digantdesai · 2025-07-23T15:29:10Z

backends/apple/coreml/test/tester.py

+def _get_static_int8_qconfig():
+    return ct.optimize.torch.quantization.LinearQuantizerConfig(
+        global_config=ct.optimize.torch.quantization.ModuleLinearQuantizerConfig(
+            quantization_scheme="symmetric",


is this the main int8 schema we should be testing for Linear @metascroy

FYI This is pulled directly from our docs at https://docs.pytorch.org/executorch/main/backends-coreml.html#bit-quantization-using-the-pt2e-flow. Would be good to sanity check with Scott, though.

backends/apple/coreml/test/tester.py

backends/test/harness/stages/quantize.py

backends/test/suite/flows/xnnpack.py

digantdesai · 2025-07-23T15:41:15Z

backends/test/suite/flow.py

-        logger.info("Skipping XNNPACK flow registration due to import failure.")
-        return None
-
+        from executorch.backends.test.suite.flows.xnnpack import XNNPACK_TEST_FLOW, XNNPACK_STATIC_INT8_TEST_FLOW


from ... import *?

I'm going to clean up flow registration slightly in the stack, so I'll take this as a follow-up there.

digantdesai · 2025-07-23T15:42:38Z

backends/test/suite/flow.py

+    quantize: bool = field(default=False)
+    """ Whether to tester should run the quantize stage on the model. """
+
+    quantize_stage_factory: Callable[..., Quantize] | None = None
+    """ A factory function which instantiates a Quantize stage. Can be None to use the tester's default. """


why an extra flag?

The specific reason is that if quantize_stage_factory isn't provided, it will use the default Quantize stage from the tester. I could maybe just always require the caller to provide quantize_stage_factory.

digantdesai · 2025-07-23T15:43:43Z

backends/test/suite/reporting.py

@@ -14,23 +14,26 @@ class TestResult(IntEnum):

    EAGER_FAIL = 2
    """ The test failed due to the model failing to run in eager mode. """
+
+    QUANTIZE_FAIL = 3


Do we want to distinguish export_for_trainint (old) before quantize and export after quantize?

If I'm following correctly, are you asking about catching errors in the quant export vs the quantization itself?

It would be nice to have this, though we'd need to refactor the tester a bit to split it.

yeah finer grained. We can do this later, hence the stamp :)

digantdesai

you can split quantizer and the using flow into two PRs but OK too.

[ghstack-poisoned]

GregoryComer · 2025-07-23T20:47:10Z

Closes #12494

[ghstack-poisoned]

ghstack-source-id: 3bd0358 ghstack-comment-id: 3105090683 Pull-Request: #12733

GregoryComer · 2025-07-23T22:51:47Z

Updated to address comments and lints. There are a few minor follow-ups (see specific comment chains), but I'm intending to land and take follows-up after to keep things moving.

[ghstack-poisoned]

GregoryComer · 2025-07-23T23:28:04Z

Unit test failures are pre-existing on main (pytest coverage error). CI was green before rebasing. Merging.

…torch#12733) Add quantized test flows core static int8 quantization for XNNPACK and Core ML backends. I also ended up doing some light refactoring on the test signature to pass the TestFlow class into the individual tests. This is done to allow for passing quantization parameters into the inner test.

GregoryComer added 20 commits July 17, 2025 17:26

Update

f120e70

[ghstack-poisoned]

Update

0fb85e6

[ghstack-poisoned]

Update

4d8d844

[ghstack-poisoned]

Update

dc12b40

[ghstack-poisoned]

Update

ead0616

[ghstack-poisoned]

Update

0f13676

[ghstack-poisoned]

Update

b0b01f2

[ghstack-poisoned]

Update

8b9c9ef

[ghstack-poisoned]

Update

06bf03a

[ghstack-poisoned]

Update

2f8f49b

[ghstack-poisoned]

Update

8ca7766

[ghstack-poisoned]

Update

bffb95f

[ghstack-poisoned]

Update

d21492b

[ghstack-poisoned]

Update

e2c4ea5

[ghstack-poisoned]

Update

8230848

[ghstack-poisoned]

Update

2a1f564

[ghstack-poisoned]

Update

b35e7b1

[ghstack-poisoned]

Update

5c4c6ce

[ghstack-poisoned]

Update

9397803

[ghstack-poisoned]

Update

9dfeb5a

[ghstack-poisoned]

GregoryComer requested review from larryliu0820, kirklandsign, JacobSzwejbka, lucylq, swolchok, shoumikhin and cccclai as code owners July 22, 2025 23:16

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 22, 2025

GregoryComer mentioned this pull request Jul 23, 2025

[Backend Tester] [WIP] Add Qualcomm tester and register flow #12739

Open

Update

7ef236b

[ghstack-poisoned]

GregoryComer mentioned this pull request Jul 23, 2025

[Backend Tester] Add CSV report generation #12741

Open