Skip to content

Commit 502d4ee

Browse files
committed
be careful of teh routed_scaling_factor
1 parent 00d0877 commit 502d4ee

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/model_executor/models/deepseek_v2.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -186,7 +186,7 @@ def forward(self, hidden_states: torch.Tensor) -> torch.Tensor:
186186
if hidden_states.dtype != torch.float16:
187187
final_hidden_states = self.experts(
188188
hidden_states=hidden_states,
189-
router_logits=router_logits) * self.routed_scaling_factor
189+
router_logits=router_logits)# * self.routed_scaling_factor
190190
else:
191191
# Fix FP16 overflow
192192
# See DeepseekV2DecoderLayer for more details.

0 commit comments

Comments
 (0)