Skip to content

Optimize AtomicNonlocal term for the GPU#1265

Open
abussy wants to merge 1 commit intoJuliaMolSim:masterfrom
abussy:nonlocal
Open

Optimize AtomicNonlocal term for the GPU#1265
abussy wants to merge 1 commit intoJuliaMolSim:masterfrom
abussy:nonlocal

Conversation

@abussy
Copy link
Copy Markdown
Collaborator

@abussy abussy commented Feb 24, 2026

This PR optimizes the AtomicNonlocal term instantiation and forces for the GPU. The same principles as #1163 and #1262 are applied. The largest performance impact is felt in stress and response calculations, when Duals are involved. However, since the AtomicNonlocal term was never fully ported to the GPU, standard SCF and forces calculations are also accelerated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant