Commit 8428e0c
authored
[Bugfix] Fix MTP support for lmhead_tensor_parallel_size (vllm-project#3915)
### What this PR does / why we need it?
Fix the issue of MTP being enabled and setting
Imhead_tensor_parallel_size=16 causing the inference to hang.
- vLLM version: v0.11.0
- vLLM main:
vllm-project/vllm@83f478b
Signed-off-by: wyh145 <[email protected]>
Signed-off-by: luolun <[email protected]>1 parent c497b9e commit 8428e0c
File tree
2 files changed
+3
-2
lines changed- vllm_ascend
- ops
- worker
2 files changed
+3
-2
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
51 | 51 | | |
52 | 52 | | |
53 | 53 | | |
54 | | - | |
| 54 | + | |
55 | 55 | | |
56 | 56 | | |
57 | 57 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2913 | 2913 | | |
2914 | 2914 | | |
2915 | 2915 | | |
2916 | | - | |
| 2916 | + | |
| 2917 | + | |
2917 | 2918 | | |
2918 | 2919 | | |
2919 | 2920 | | |
| |||
0 commit comments