[GPTQ] Change actorder default to "static" #1425

kylesayrs · 2025-05-12T21:25:56Z

Purpose

Use best defaults for GPTQ quantization

Prerequisites

Changes

Set gptq actorder default to "static"

Testing

Ran llama w4a16 example to completion and validated the correct activation ordering

dsikka · 2025-05-13T13:59:30Z

nice!

markurtz · 2025-05-14T20:07:05Z

@kylesayrs quick question from my side. Since the old default was None, do we risk defaulting to an incorrect value for older recipes that don't include it? Especially for ones that specified it in the quantization scheme?

The base branch was changed.

kylesayrs · 2025-05-16T16:18:02Z

@markurtz

"older" recipes which did not specify will now use "static". This is (more or less) inevitable and imho acceptable, since a user which does not specify actorder probably does not care and just wants the best configuration
For recipes which specify anything except "static", they will see this error encouraging them to modify their recipe to disable global actorder

## Purpose ## * Make actorder option more intuitive for users * Enable easier adjustment of actorder default #1425 * This change is conceptually intuitive because activation ordering is a concept that only applies to the GPTQ algorithm (the only algorithm for which quantization group order matters) ## Changes ## * Add `actorder` argument to `GPTQModifier` * Override `resolve_quantization_config` method to resolve config groups with `actorder` argument * (Misc) rearrange method order to match the typical order in which they are called in the modifier lifecycle ## Testing ## * Ran llama w4a16 example to completion Signed-off-by: Kyle Sayers <[email protected]>

The base branch was changed.

Signed-off-by: Kyle Sayers <[email protected]>

## Purpose ## * Fix false assumption that `actorder` field is of enum type * Despite the fact that actorder passes through a [field_validator](https://github.com/neuralmagic/compressed-tensors/blob/main/src/compressed_tensors/quantization/quant_args.py#L200), `QuantizationArgs` has the [use_enum_values](https://github.com/neuralmagic/compressed-tensors/blob/main/src/compressed_tensors/quantization/quant_args.py#L128) configuration set, meaning that enum values are converted to strings. * This was done in relation to [this fix](neuralmagic/sparseml#2327) * Remove conflict with recipes which manually specify activation ordering by using a sentinel value ## Follow ups ## * #1425 ## Testing ## * Ran llama3 example with manually specified `actorder=group` --------- Signed-off-by: Kyle Sayers <[email protected]> Co-authored-by: Dipika Sikka <[email protected]>

…fault

Signed-off-by: Kyle Sayers <[email protected]>

…efault

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs · 2025-05-30T04:12:05Z

Waiting for next weekly to run before merging

The base branch was changed.

…fault

Signed-off-by: Kyle Sayers <[email protected]>

dsikka

FYI this will incorrectly save models for our e2e testing:
https://github.com/vllm-project/llm-compressor/tree/main/tests/e2e/vLLM/configs (specifically, essentially remove the cases where there is no act order while duplicating some cases)

Same for lm-eval tests:
https://github.com/vllm-project/llm-compressor/tree/main/tests/lmeval/configs

kylesayrs · 2025-06-26T17:55:15Z

@dsikka

specifically, essentially remove the cases where there is no act order while duplicating some cases

Turning all non-specified actorder cases into actorder cases is the intention of this PR.

As for duplicating tests, I don't think any are actually duplicated? In order for a duplication to occur, it must conflict with an existing weight actorder config

w4a16_actorder_weight.yaml
w4a16_actorder_weight_qwen.yaml

vl_w4a16_actorder_weight.yaml
w4a16_actorder_weight.yaml

These are all w4a16. If you grep for non-specified w4a16 configs, you get

w4a16_channel_quant.yaml  # channelwise, not duplicating
w4a16_channel_quant_qwen.yaml  # channelwise, not duplicating
w4a16_grouped_quant.yaml  # different calibration and dataset, not duplicating
w4a16_grouped_quant_asym_awq.yaml  # AWQ, not duplicating
w4a16_2of4_channel_quant.yaml  # 2of4, not duplicating
w4a16_2of4_grouped_quant.yaml  # 2of4, not duplicating

All of the other w4a16 configs test for different things, except for maybe w4a16_grouped_quant.yaml which essentially tests for the same thing but has slightly different model and dataset

dsikka · 2025-06-26T18:16:32Z

w4a16_actorder_weight.yaml

We lose all non-act order cases. We still need to test this in e2e and lm-eval
We need to update the naming in the model produced/pushed to nm-testing so that we indicate act order being present
We have two identical lm-eval cases?

tests/lmeval/configs/w4a16_actorder_weight.yaml and tests/lmeval/configs/w4a16_grouped_quant.yaml

kylesayrs · 2025-06-27T19:22:46Z

https://github.com/neuralmagic/llm-compressor-testing/actions/runs/15934262421

kylesayrs · 2025-06-27T19:25:59Z

We lose all non-act order cases. We still need to test this in e2e and lm-eval

Just posted a run above

We need to update the naming in the model produced/pushed to nm-testing so that we indicate act order being present

Since actorder is now the default, we do not need to include it in model names, since the convention is that default arguments are not included in the model name

We have two identical lm-eval cases?

As I mentioned previously, these two tests are slightly different. After the above nightly finishes, I'll delete tests/lmeval/configs/w4a16_actorder_weight.yaml

dsikka · 2025-06-27T20:36:12Z

We lose all non-act order cases. We still need to test this in e2e and lm-eval

Just posted a run above

We need to update the naming in the model produced/pushed to nm-testing so that we indicate act order being present

Since actorder is now the default, we do not need to include it in model names, since the convention is that default arguments are not included in the model name

We have two identical lm-eval cases?

As I mentioned previously, these two tests are slightly different. After the above nightly finishes, I'll delete tests/lmeval/configs/w4a16_actorder_weight.yaml

This is for our internal testing. Please keep the naming convention until we have adaptability of this case being our default.

And as I mentioned, we still need to test a case with no act order.

kylesayrs mentioned this pull request May 12, 2025

[GPTQ] Add actorder option to modifier #1424

Merged

brian-dellabetta previously approved these changes May 12, 2025

View reviewed changes

kylesayrs changed the base branch from kylesayrs/gptq-actorder to main May 15, 2025 04:40

kylesayrs changed the base branch from main to kylesayrs/gptq-actorder May 15, 2025 04:40

brian-dellabetta previously approved these changes May 16, 2025

View reviewed changes

rahul-tuli previously approved these changes May 16, 2025

View reviewed changes

Base automatically changed from kylesayrs/gptq-actorder to main May 19, 2025 16:39

fix resolution, add sentinel

7769942

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs force-pushed the kylesayrs/gptq-actorder-default branch from 8cf408e to 8b9f795 Compare May 20, 2025 20:58

kylesayrs changed the base branch from main to kylesayrs/fix-default-actorder May 20, 2025 20:58

kylesayrs added 3 commits May 20, 2025 17:07

make sentinel validatable

a5f693d

Signed-off-by: Kyle Sayers <[email protected]>

add actorder

94857c4

Signed-off-by: Kyle Sayers <[email protected]>

change default

f6a0e25

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs force-pushed the kylesayrs/gptq-actorder-default branch from 8b9f795 to f6a0e25 Compare May 20, 2025 21:11

kylesayrs mentioned this pull request May 20, 2025

[GPTQ] Fix actorder resolution, add sentinel #1453

Merged

kylesayrs and others added 5 commits May 20, 2025 17:37

add sentinel test

7a2b417

Signed-off-by: Kyle Sayers <[email protected]>

reduce validation, update docstring

42257cd

Signed-off-by: Kyle Sayers <[email protected]>

Merge branch 'main' into kylesayrs/fix-default-actorder

1410992

delowercase

bc6e66c

Signed-off-by: Kyle Sayers <[email protected]>

wip

c555cc3

Signed-off-by: Kyle Sayers <[email protected]>

Base automatically changed from kylesayrs/fix-default-actorder to main May 22, 2025 18:09

Merge branch 'main' into kylesayrs/actorder-test

b0ffd5d

Merge remote-tracking branch 'origin' into kylesayrs/gptq-actorder-de…

61349a6

…fault

brian-dellabetta previously approved these changes May 22, 2025

View reviewed changes

kylesayrs added 4 commits May 29, 2025 13:58

Merge remote-tracking branch 'origin' into kylesayrs/actorder-test

fcea958

add tests

8d7fb4e

Signed-off-by: Kyle Sayers <[email protected]>

Merge branch 'kylesayrs/actorder-test' into kylesayrs/gptq-actorder-d…

0c631dc

…efault

update tests

096030e

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs dismissed brian-dellabetta’s stale review via 096030e May 29, 2025 18:23

kylesayrs changed the base branch from main to kylesayrs/actorder-test May 29, 2025 18:23

kylesayrs requested a review from rahul-tuli May 29, 2025 18:29

brian-dellabetta previously approved these changes May 29, 2025

View reviewed changes

rahul-tuli previously approved these changes May 30, 2025

View reviewed changes

Base automatically changed from kylesayrs/actorder-test to main May 30, 2025 04:57

brian-dellabetta previously approved these changes May 30, 2025

View reviewed changes

kylesayrs added 4 commits June 3, 2025 11:52

Merge remote-tracking branch 'origin' into kylesayrs/gptq-actorder-de…

b50768f

…fault

remove unused files

2c670f8

Signed-off-by: Kyle Sayers <[email protected]>

fix comment

fa7a898

Signed-off-by: Kyle Sayers <[email protected]>

add more tests

79af1ad

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs dismissed brian-dellabetta’s stale review via 79af1ad June 3, 2025 18:59

dsikka requested changes Jun 26, 2025

View reviewed changes

Merge branch 'main' into kylesayrs/gptq-actorder-default

bebb415

brian-dellabetta approved these changes Jun 27, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[GPTQ] Change actorder default to "static" #1425

[GPTQ] Change actorder default to "static" #1425

Uh oh!

kylesayrs commented May 12, 2025 •

edited

Loading

Uh oh!

dsikka commented May 13, 2025

Uh oh!

markurtz commented May 14, 2025

Uh oh!

kylesayrs commented May 16, 2025

Uh oh!

kylesayrs commented May 30, 2025

Uh oh!

dsikka left a comment •

edited

Loading

Uh oh!

kylesayrs commented Jun 26, 2025 •

edited

Loading

Uh oh!

dsikka commented Jun 26, 2025

Uh oh!

kylesayrs commented Jun 27, 2025

Uh oh!

kylesayrs commented Jun 27, 2025

Uh oh!

dsikka commented Jun 27, 2025 •

edited

Loading

Uh oh!

Uh oh!

[GPTQ] Change actorder default to "static" #1425

Are you sure you want to change the base?

[GPTQ] Change actorder default to "static" #1425

Uh oh!

Conversation

kylesayrs commented May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Prerequisites

Changes

Testing

Uh oh!

dsikka commented May 13, 2025

Uh oh!

markurtz commented May 14, 2025

Uh oh!

kylesayrs commented May 16, 2025

Uh oh!

kylesayrs commented May 30, 2025

Uh oh!

dsikka left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kylesayrs commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dsikka commented Jun 26, 2025

Uh oh!

kylesayrs commented Jun 27, 2025

Uh oh!

kylesayrs commented Jun 27, 2025

Uh oh!

dsikka commented Jun 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

kylesayrs commented May 12, 2025 •

edited

Loading

dsikka left a comment •

edited

Loading

kylesayrs commented Jun 26, 2025 •

edited

Loading

dsikka commented Jun 27, 2025 •

edited

Loading