Skip to content

Commit 7e83993

Browse files
committed
fix(lint): fix lint
Signed-off-by: zhoux77899 <[email protected]>
1 parent af72a56 commit 7e83993

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

vllm_ascend/quantization/w8a8_dynamic.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -199,7 +199,7 @@ def apply_mlp(hidden_states: torch.Tensor,
199199
group_list = torch.cat(
200200
[group_list[:1], torch.diff(group_list, dim=0)])
201201
group_list_type = 1
202-
bias1 = w1_scale_bias
202+
bias1 = [w1_scale_bias] if is_torchair else w1_scale_bias
203203
bias2 = [w2_scale_bias]
204204
# TODO w4a8 scene: dynamic acquisition of dtype in the future
205205
_output_dtype = torch.bfloat16
@@ -219,7 +219,7 @@ def apply_mlp(hidden_states: torch.Tensor,
219219
x=[hidden_states],
220220
weight=[w1],
221221
scale=[w1_scale.to(w2_scale.dtype)],
222-
bias=[bias1] if isinstance(bias1, torch.Tensor) else bias1,
222+
bias=bias1,
223223
per_token_scale=[pertoken_scale],
224224
split_item=2,
225225
group_list_type=group_list_type,

0 commit comments

Comments
 (0)