File tree Expand file tree Collapse file tree 1 file changed +3
-2
lines changed
src/llmcompressor/modifiers/quantization/gptq Expand file tree Collapse file tree 1 file changed +3
-2
lines changed Original file line number Diff line number Diff line change @@ -76,8 +76,9 @@ class GPTQModifier(Modifier, QuantizationMixin):
76
76
:param block_size: Used to determine number of columns to compress in one pass
77
77
:param dampening_frac: Amount of dampening to apply to H, as a fraction of the
78
78
diagonal norm
79
- :param actorder: order in which weight columns are quantized. For more information,
80
- on actorder options, see https://github.com/vllm-project/vllm/pull/8135
79
+ :param actorder: order in which weight columns are quantized. Defaults to "static"
80
+ activation ordering, which achieves best accuracy recovery with no runtime cost.
81
+ For more information, see https://github.com/vllm-project/vllm/pull/8135
81
82
:param offload_hessians: Set to True for decreased memory usage but increased
82
83
runtime.
83
84
You can’t perform that action at this time.
0 commit comments