Optimize AtomicNonlocal term for the GPU by abussy · Pull Request #1265 · JuliaMolSim/DFTK.jl

abussy · 2026-02-24T14:20:03Z

This PR optimizes the AtomicNonlocal term instantiation and forces for the GPU. The same principles as #1163 and #1262 are applied. The largest performance impact is felt in stress and response calculations, when Duals are involved. However, since the AtomicNonlocal term was never fully ported to the GPU, standard SCF and forces calculations are also accelerated.

Optimize AtomicNonlocal term for the GPU

3403ae0

mfherbst mentioned this pull request Feb 26, 2026

Optimizing XC instantiation for GPUs #1262

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize AtomicNonlocal term for the GPU#1265

Optimize AtomicNonlocal term for the GPU#1265
abussy wants to merge 1 commit intoJuliaMolSim:masterfrom
abussy:nonlocal

abussy commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

abussy commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant