[ET-VK][q8ta] Add q8ta_linear operator for int8 quantized linear by SS-JIA · Pull Request #17565 · pytorch/executorch

SS-JIA · 2026-02-19T19:48:44Z

Stack from ghstack (oldest at bottom):

Add a new q8ta_linear operator that performs fully quantized int8
linear (matmul + bias) with per-tensor activation quantization and
per-channel weight quantization, producing int8 output. This enables
back-to-back quantized linear layers without intermediate
dequantize/quantize steps.

The operator reuses the existing tiled int8 linear GLSL headers
(input/weight tile loading, int8 dot product accumulation, weight
scales/sums/bias loading) and adds output quantization via
quantize_and_pack to produce packed int8 output.

The fusion pass in quantized_linear.py detects the
q→dq→linear→q pattern (where the output quantize node comes from a
subsequent quantized op's input) and fuses it into a single
q8ta_linear call.

This diff was authored with Claude.

Differential Revision: D93768642

Add a new q8ta_linear operator that performs fully quantized int8 linear (matmul + bias) with per-tensor activation quantization and per-channel weight quantization, producing int8 output. This enables back-to-back quantized linear layers without intermediate dequantize/quantize steps. The operator reuses the existing tiled int8 linear GLSL headers (input/weight tile loading, int8 dot product accumulation, weight scales/sums/bias loading) and adds output quantization via quantize_and_pack to produce packed int8 output. The fusion pass in quantized_linear.py detects the q→dq→linear→q pattern (where the output quantize node comes from a subsequent quantized op's input) and fuses it into a single q8ta_linear call. This diff was authored with Claude. Differential Revision: [D93768642](https://our.internmc.facebook.com/intern/diff/D93768642/) [ghstack-poisoned]

pytorch-bot · 2026-02-19T19:48:48Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17565

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures

As of commit 3a94e2e with merge base 7b843e4 ():

NEW FAILURES - The following jobs have failed:

pull / android / run-emulator (gh)
The process '/opt/android/sdk/platform-tools/adb' failed with exit code 224
pull / test-openvino-linux / linux-job (gh)
RuntimeError: Command docker exec -t 64a9af1b51d6cd215752b89e914faef2a28eae7401faafc04e2b70d754aa49b4 /exec failed with exit code 1
pull / test-vulkan-operators-linux / linux-job (gh)
RuntimeError: Command docker exec -t 5084322794c49ddac8f6e05b670df7cbe14fecd8ad57057e26135c050f8b25d5 /exec failed with exit code 139
pull / unittest-nxp-neutron / linux-job (gh)
RuntimeError: Command docker exec -t f3f05c0c1b25ebc7ba57b5dac2ee874ec50d9624788cc60925502e41e8f37d00 /exec failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-02-19T19:49:48Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 19, 2026

meta-codesync bot added fb-exported meta-exported labels Feb 19, 2026

manuelcandales approved these changes Feb 19, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ET-VK][q8ta] Add q8ta_linear operator for int8 quantized linear#17565

[ET-VK][q8ta] Add q8ta_linear operator for int8 quantized linear#17565
SS-JIA wants to merge 1 commit intogh/SS-JIA/439/basefrom
gh/SS-JIA/439/head

SS-JIA commented Feb 19, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Feb 19, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

SS-JIA commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17565

❌ 4 New Failures

Uh oh!

github-actions bot commented Feb 19, 2026

This PR needs a release notes: label

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

SS-JIA commented Feb 19, 2026 •

edited

Loading

pytorch-bot bot commented Feb 19, 2026 •

edited

Loading

This PR needs a `release notes:` label