We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 71b5976 commit a4d6b38Copy full SHA for a4d6b38
src/common/include/macros.fpp
@@ -12,6 +12,10 @@
12
#endif
13
#:enddef
14
15
+! Caution:
16
+! This macro requires the use of a binding script to set CUDA_VISIBLE_DEVICES, such that we have one GPU device per MPI rank.
17
+! That's because for both cudaMemAdvise (preferred location) and cudaMemPrefetchAsync we use location = device_id = 0.
18
+! For an example see misc/nvidia_uvm/bind.sh.
19
#:def PREFER_GPU(*args)
20
#ifdef MFC_SIMULATION
21
#ifdef __NVCOMPILER_GPU_UNIFIED_MEM
0 commit comments