You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/backend/CANN.md
+4Lines changed: 4 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -314,3 +314,7 @@ Converting the matmul weight format from ND to NZ to improve performance. Enable
314
314
### GGML_CANN_ACL_GRAPH
315
315
316
316
Operators are executed using ACL graph execution, rather than in op-by-op (eager) mode. Enabled by default.
317
+
318
+
### GGML_CANN_GRAPH_CACHE_CAPACITY
319
+
320
+
Maximum number of compiled CANN graphs kept in the LRU cache, default is 12. When the number of cached graphs exceeds this capacity, the least recently used graph will be evicted.
0 commit comments