Skip to content

Commit bc30deb

Browse files
committed
fix CI
Signed-off-by: elvischenv <[email protected]>
1 parent 2d1cf56 commit bc30deb

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

vllm/attention/backends/flashinfer.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1106,7 +1106,8 @@ def forward(
11061106
prefill_output: Optional[torch.Tensor] = None
11071107
if num_decode_tokens > 0:
11081108
decode_output = torch.empty(decode_query.shape,
1109-
dtype=decode_query.dtype)
1109+
dtype=decode_query.dtype,
1110+
device=decode_query.device)
11101111
else:
11111112
decode_output = None
11121113
stride_order = FlashInferBackend.get_kv_cache_stride_order()

0 commit comments

Comments
 (0)