Replies: 1 comment
-
Our OpenMP strategy is explained in this paper, https://arxiv.org/abs/1604.03570. As for the |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi, how does the coarse-grained OpenMP parallelism work in AMReX? This is with reference to the GPU/CNS tutorial where Make.CNS defines AMReX as using a coarse-grained OpenMP approach.
From the documentation, I understand that when compiling with MPI+OpenMP, each MPI rank/processor gets its own DistributionMapping (which may have multiple MFIter loops) and OpenMP threads are spawned on each MPI rank to run MFIter loops in parallel. This appears to be fine-grained parallelism on OpenMP.
How is a coarse-grained OpenMP approach (by setting 'DEFINES += -DAMREX_CRSEGRNDOMP' ) turn out to be different from the fine-grained approach? Further, what would be the best approach to achieve maximum compute performance for a hybrid MPI+OpenMP code?
I would really appreciate your thoughts on this. Thanks in advance
Beta Was this translation helpful? Give feedback.
All reactions