Improve Parallelisation #365

daviesje · 2024-03-28T11:06:46Z

The speedup from parallelisation in the current form of 21cmFAST drops significantly after ~4 cores. It would be nice to look into possible improvements so that we can scale better with larger boxes, or with some of the slower modes of running the model.

Some improvements could include:

Scheduling: The default scheduling allocates large chunks of the box to each thread, this may result in an imbalance since regions of similar density will be allocated to the same thread. Either reducing the chunk size or using dynamic scheduling for the more intensive parallel regions may help with this.
Parallel region structure: This will be more difficult to test (and will likely require a better profiling setup) but it's possible that splitting up some of the larger parallel loops into chunks of similar computation will help the balancing.

A more minor point on how OpenMP is written is that in the previously existing parts of the code, every variable used in a parallel region is explicitly declared as shared or private in the directive, whereas my additions rely on the default scoping, where all variables are assumed shared unless declared inside the parallel region. I wrote them like this since I believe it is more readable, however I would appreciate some input on which style is preferred by others, so that we can make it uniform across the package.

daviesje added context: C backend Changes occur predominantly in the C code context: v4-prep This issue regards changes to the v4-prep branch priority: low type: performance: speed Changes affecting speed of calculations labels Mar 28, 2024

daviesje added this to 21cmFAST v4 Mar 28, 2024

daviesje moved this to Backlog in 21cmFAST v4 Apr 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Parallelisation #365

Improve Parallelisation #365

daviesje commented Mar 28, 2024

Improve Parallelisation #365

Improve Parallelisation #365

Comments

daviesje commented Mar 28, 2024