Heterogeneously run the LLaMA model on both the QNN and XNNPACK backends. #13629

yujiaoliang · 2025-08-25T08:02:56Z

yujiaoliang
Aug 25, 2025

I’m planning to deploy the quantized LLaMA 3.2-3B model on QNN and run some of its linear layers on XNNPACK. Would this be possible?
Is this kind of setup supported at the moment?

GregoryComer · 2025-08-25T21:39:03Z

GregoryComer
Aug 25, 2025
Collaborator

@yujiaoliang For QNN, specifically, you can instruct the QNN partitioner to skip specific node IDs or operators, which will allow them to fall back to XNNPACK. See the QNN partitioner args here - https://www.internalfb.com/code/fbsource/[3369a2d3a668]/fbcode/executorch/backends/qualcomm/partition/qnn_partitioner.py?lines=135. You can then pass both the QnnPartitioner and XnnpackPartitioner to_edge_transform_and_lower. The second partitioner will act as a fallback.

to_edge_transform_and_lower(
    ep,
    partitioner=[qnn_partitioner, xnnpack_partitioner]
)

You can also provide a custom partitioner for advanced use cases, but it will require a bit of coding. There is an example in https://docs.pytorch.org/executorch/main/compiler-delegate-and-partitioner.html#common-questions under "5. Can we delegate to multiple backends?".

2 replies

yujiaoliang Aug 26, 2025
Author

If my model has been quantized, can I still use this approach?

GregoryComer Aug 26, 2025
Collaborator

@abhinaykukkadapu Do you know if the skip ops will respect quantized variants?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Heterogeneously run the LLaMA model on both the QNN and XNNPACK backends. #13629

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Heterogeneously run the LLaMA model on both the QNN and XNNPACK backends. #13629

Uh oh!

yujiaoliang Aug 25, 2025

Replies: 1 comment · 2 replies

Uh oh!

Uh oh!

GregoryComer Aug 25, 2025 Collaborator

Uh oh!

yujiaoliang Aug 26, 2025 Author

Uh oh!

GregoryComer Aug 26, 2025 Collaborator

yujiaoliang
Aug 25, 2025

Replies: 1 comment 2 replies

GregoryComer
Aug 25, 2025
Collaborator

yujiaoliang Aug 26, 2025
Author

GregoryComer Aug 26, 2025
Collaborator