Deprecating Int8DynActInt4WeightQuantizer #1332

jerryzh168 · 2024-10-28T18:35:38Z

Summary:
Added torchao API int8_dynamic_activation_int4_weight I ran some benchmark with eager mode and there was some accuracy loss and some slowdowns in compile.

But later I found we only care about performance on executorch. So was trying to benchmark et perf and accuracy. but I can't get the env setup correctly for executorch to run the experiments

Specifically there were some issues when I'm trying to run the following after I installed executorch from pip

python3 torchchat.py export llama3.1 --quantize torchchat/quant_config/mobile.json --output-pte-path llama3.1.pte

Error:

Traceback (most recent call last):
  File "/data/users/jerryzh/torchchat/torchchat.py", line 17, in <module>
    from torchchat.cli.cli import (
  File "/data/users/jerryzh/torchchat/torchchat/cli/cli.py", line 14, in <module>
    import torch
  File "/home/jerryzh/.conda/envs/torchchat/lib/python3.10/site-packages/torch/__init__.py", line 368, in <module>
    from torch._C import *  # noqa: F403
ImportError: /home/jerryzh/.conda/envs/torchchat/lib/python3.10/site-packages/torch/lib/../../nvidia/cusparse/lib/libcusparse.so.12: undefined symbol: __nvJitLinkComplete_12_4, version libnvJitLink.so.12

Test Plan:
test instruction from Jack: https://docs.google.com/document/d/1eRAoY1Jq4SR5A7iAYC71maSZAPzsBJmr9VuZPhR5ZYA/edit?tab=t.0#bookmark=id.otk3jomaciya

Reviewers:

Subscribers:

Tasks:

Tags:

Summary: Added torchao API int8_dynamic_activation_int4_weight I ran some benchmark with eager mode and there was some accuracy loss and some slowdowns in compile. But later I found we only care about performance on executorch. So was trying to benchmark et perf and accuracy. but I can't get the env setup correctly for executorch to run the experiments Specifically there were some issues when I'm trying to run the following after I installed executorch from pip ``` python3 torchchat.py export llama3.1 --quantize torchchat/quant_config/mobile.json --output-pte-path llama3.1.pte ``` Error: ``` Traceback (most recent call last): File "/data/users/jerryzh/torchchat/torchchat.py", line 17, in <module> from torchchat.cli.cli import ( File "/data/users/jerryzh/torchchat/torchchat/cli/cli.py", line 14, in <module> import torch File "/home/jerryzh/.conda/envs/torchchat/lib/python3.10/site-packages/torch/__init__.py", line 368, in <module> from torch._C import * # noqa: F403 ImportError: /home/jerryzh/.conda/envs/torchchat/lib/python3.10/site-packages/torch/lib/../../nvidia/cusparse/lib/libcusparse.so.12: undefined symbol: __nvJitLinkComplete_12_4, version libnvJitLink.so.12 ``` Test Plan: test instruction from Jack: https://docs.google.com/document/d/1eRAoY1Jq4SR5A7iAYC71maSZAPzsBJmr9VuZPhR5ZYA/edit?tab=t.0#bookmark=id.otk3jomaciya Reviewers: Subscribers: Tasks: Tags:

pytorch-bot · 2024-10-28T18:35:42Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1332

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Jack-Khuu · 2024-12-09T21:22:32Z

@vmpuri jic you missed this

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 28, 2024

Jack-Khuu requested a review from vmpuri December 9, 2024 21:22

Jack-Khuu added the Quantization Issues related to Quantization or torchao label Dec 18, 2024

jerryzh168 closed this Aug 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Deprecating Int8DynActInt4WeightQuantizer #1332

Deprecating Int8DynActInt4WeightQuantizer #1332

Uh oh!

jerryzh168 commented Oct 28, 2024

Uh oh!

pytorch-bot bot commented Oct 28, 2024

Uh oh!

Jack-Khuu commented Dec 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Deprecating Int8DynActInt4WeightQuantizer #1332

Deprecating Int8DynActInt4WeightQuantizer #1332

Uh oh!

Conversation

jerryzh168 commented Oct 28, 2024

Uh oh!

pytorch-bot bot commented Oct 28, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1332

Uh oh!

Jack-Khuu commented Dec 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants