Skip to content

Commit 14e2b07

Browse files
[BugFix] fix CUTLASS MLA full cudagraph (#23200)
Signed-off-by: Lucas Wilkinson <[email protected]>
1 parent 0f4f019 commit 14e2b07

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/v1/attention/backends/mla/cutlass_mla.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@
2121

2222
class CutlassMLAMetadataBuilder(MLACommonMetadataBuilder[MLACommonMetadata]):
2323
# enable full CUDA Graph support for decode-only capture
24-
attn_cudagraph_support: ClassVar[
24+
cudagraph_support: ClassVar[
2525
AttentionCGSupport] = AttentionCGSupport.UNIFORM_SINGLE_TOKEN_DECODE
2626

2727

0 commit comments

Comments
 (0)