Skip to content

Share kv-cache in multimethod pte #17496

@lucylq

Description

@lucylq

To reduce runtime memory when we have a multimethod PTE file, have each method point to the same kv-cache.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

Status

In progress

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions