Skip to content

Conversation

noemotiovon
Copy link
Collaborator

Introduce a high performance mode for the CANN backend. In this mode, intermediate computation states are stored in FP16, which improves execution performance at the cost of slightly reduced precision.

Make sure to read the contributing guidelines before submitting a PR

Introduce a high performance mode for the CANN backend.
In this mode, intermediate computation states are stored in FP16,
which improves execution performance at the cost of slightly reduced precision.
@github-actions github-actions bot added documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning Ascend NPU issues specific to Ascend NPUs labels Sep 25, 2025
@noemotiovon
Copy link
Collaborator Author

Further discussion at #16251

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Ascend NPU issues specific to Ascend NPUs documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant