How do I evaluate the performance of a `.pte` file after Executorch quantization? #14988

cupid20103 · 2025-10-10T13:16:42Z

cupid20103
Oct 10, 2025

Hello everyone!

I'm currently working on a react-native project and want to integrate the Hugging Face model (a wav2vec2 model) into my app.
As you know, to use the large model on the mobile environment, the model needs to be quantized.

So, I used executorch to quantize, and it generated a .pte file.
For now, I need to evaluate the performance (PER, FER, RTF, Peak RAM usage) of this file.

How to solve this problem?

cccclai · 2025-11-25T16:51:38Z

cccclai
Nov 25, 2025
Collaborator

Sorry for getting back late. Do you mean that you want to measure the accuracy on device? I think there are two ways, one is bundling the testing input/reference output with the model, and the other one is to push the data on device and pull the data back. cc: @Gasoonjia

0 replies

ashiq-km · 2025-12-01T19:50:03Z

ashiq-km
Dec 1, 2025

Hi! 👋

That’s a great question. I’m not an expert in wav2vec2 quantization, but here’s what I’d try if I were evaluating the .pte file on mobile:

Performance Metrics

PER/FER: You can feed some audio samples into the quantized model and compare the predicted transcription with the ground truth. Calculate PER (Phoneme Error Rate) or FER (Frame Error Rate) using a small evaluation script.

RTF (Real-Time Factor): Measure the time it takes for the model to process an audio sample and divide by the audio duration. This gives you RTF.

Peak RAM Usage: On Android, you could use Android Studio Profiler; on iOS, Xcode Instruments can track memory usage while running inference.

If executorch produced a .pte file, you’ll likely need to load it using their runtime and run a loop over your test audio samples while logging time and memory.

Hugging Face has a guide on quantization
(might help with scripts).

You can check torchaudio or transformers examples for evaluating WER/PER.

Hope this helps! I’d be curious to hear if you find a simple way to log all metrics together – I’m planning to try something similar in my own project.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How do I evaluate the performance of a `.pte` file after Executorch quantization? #14988

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How do I evaluate the performance of a .pte file after Executorch quantization? #14988

Uh oh!

cupid20103 Oct 10, 2025

Replies: 2 comments

Uh oh!

cccclai Nov 25, 2025 Collaborator

Uh oh!

ashiq-km Dec 1, 2025

How do I evaluate the performance of a `.pte` file after Executorch quantization? #14988

cupid20103
Oct 10, 2025

cccclai
Nov 25, 2025
Collaborator

ashiq-km
Dec 1, 2025