[AWQ] Allow users to disable quantization during AWQ #1973

brian-dellabetta · 2025-10-28T22:28:06Z

SUMMARY:
We want to provide users the ability to disable quantization in AWQModifier, so the best scales are found and applied to the weights without round-to-nearest quantization. This is useful when someone wants to run AWQ and GPTQ, before ultimately quantizing weights. This is a draft solution, see #1972 for discussion

TEST PLAN:
"please outline how the changes were tested"

Signed-off-by: Brian Dellabetta <[email protected]>

github-actions · 2025-10-28T22:28:14Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

zhanglei1172 · 2025-10-30T09:57:47Z

Currently, if only want the scale of awq to take effect and do not want to perform the quantization process, one can simply set save compressed= False in the model.save pretrained method (maybe the scale+zero point parameter needs to be manually deleted).

brian-dellabetta · 2025-10-30T14:16:10Z

Currently, if only want the scale of awq to take effect and do not want to perform the quantization process, one can simply set save compressed= False in the model.save pretrained method (maybe the scale+zero point parameter needs to be manually deleted).

Hi @zhanglei1172 , yes that's probably true. But it would require some manual overhead for a user who wants to do that, whereas this could just be a single boolean flag disable_quantization, based on discussion #1972. I think it would amount to the same thing, scales and zero points are just not created and quantization configs and statuses are pruned from the target modules, so the checkpoint at the end of the pipeline would just have modified weights

modernized typehints, targets=None option

a9d7316

Signed-off-by: Brian Dellabetta <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AWQ] Allow users to disable quantization during AWQ #1973

[AWQ] Allow users to disable quantization during AWQ #1973

brian-dellabetta commented Oct 28, 2025

Uh oh!

github-actions bot commented Oct 28, 2025

Uh oh!

zhanglei1172 commented Oct 30, 2025

Uh oh!

brian-dellabetta commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[AWQ] Allow users to disable quantization during AWQ #1973

Are you sure you want to change the base?

[AWQ] Allow users to disable quantization during AWQ #1973

Conversation

brian-dellabetta commented Oct 28, 2025

Uh oh!

github-actions bot commented Oct 28, 2025

Uh oh!

zhanglei1172 commented Oct 30, 2025

Uh oh!

brian-dellabetta commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants