Skip to content
This repository was archived by the owner on Sep 10, 2025. It is now read-only.

Conversation

@jerryzh168
Copy link
Contributor

Summary:
Added torchao API int8_dynamic_activation_int4_weight I ran some benchmark with eager mode and there was some accuracy loss and some slowdowns in compile.

But later I found we only care about performance on executorch. So was trying to benchmark et perf and accuracy. but I can't get the env setup correctly for executorch to run the experiments

Specifically there were some issues when I'm trying to run the following after I installed executorch from pip

python3 torchchat.py export llama3.1 --quantize torchchat/quant_config/mobile.json --output-pte-path llama3.1.pte

Error:

Traceback (most recent call last):
  File "/data/users/jerryzh/torchchat/torchchat.py", line 17, in <module>
    from torchchat.cli.cli import (
  File "/data/users/jerryzh/torchchat/torchchat/cli/cli.py", line 14, in <module>
    import torch
  File "/home/jerryzh/.conda/envs/torchchat/lib/python3.10/site-packages/torch/__init__.py", line 368, in <module>
    from torch._C import *  # noqa: F403
ImportError: /home/jerryzh/.conda/envs/torchchat/lib/python3.10/site-packages/torch/lib/../../nvidia/cusparse/lib/libcusparse.so.12: undefined symbol: __nvJitLinkComplete_12_4, version libnvJitLink.so.12

Test Plan:
test instruction from Jack: https://docs.google.com/document/d/1eRAoY1Jq4SR5A7iAYC71maSZAPzsBJmr9VuZPhR5ZYA/edit?tab=t.0#bookmark=id.otk3jomaciya

Reviewers:

Subscribers:

Tasks:

Tags:

Summary:
Added torchao API int8_dynamic_activation_int4_weight
I ran some benchmark with eager mode and there was some accuracy loss and some slowdowns in compile.

But later I found we only care about performance on executorch. So was trying to benchmark et perf and accuracy. but I can't get the env setup correctly for executorch to run the experiments

Specifically there were some issues when I'm trying to run the following after I installed executorch from pip
```
python3 torchchat.py export llama3.1 --quantize torchchat/quant_config/mobile.json --output-pte-path llama3.1.pte
```

Error:
```
Traceback (most recent call last):
  File "/data/users/jerryzh/torchchat/torchchat.py", line 17, in <module>
    from torchchat.cli.cli import (
  File "/data/users/jerryzh/torchchat/torchchat/cli/cli.py", line 14, in <module>
    import torch
  File "/home/jerryzh/.conda/envs/torchchat/lib/python3.10/site-packages/torch/__init__.py", line 368, in <module>
    from torch._C import *  # noqa: F403
ImportError: /home/jerryzh/.conda/envs/torchchat/lib/python3.10/site-packages/torch/lib/../../nvidia/cusparse/lib/libcusparse.so.12: undefined symbol: __nvJitLinkComplete_12_4, version libnvJitLink.so.12
```

Test Plan:
test instruction from Jack: https://docs.google.com/document/d/1eRAoY1Jq4SR5A7iAYC71maSZAPzsBJmr9VuZPhR5ZYA/edit?tab=t.0#bookmark=id.otk3jomaciya

Reviewers:

Subscribers:

Tasks:

Tags:
@pytorch-bot
Copy link

pytorch-bot bot commented Oct 28, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1332

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 28, 2024
@Jack-Khuu Jack-Khuu requested a review from vmpuri December 9, 2024 21:22
@Jack-Khuu
Copy link
Contributor

@vmpuri jic you missed this

@Jack-Khuu Jack-Khuu added the Quantization Issues related to Quantization or torchao label Dec 18, 2024
@jerryzh168 jerryzh168 closed this Aug 11, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

CLA Signed This label is managed by the Meta Open Source bot. Quantization Issues related to Quantization or torchao

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants