Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve Parallelisation #365

Open
daviesje opened this issue Mar 28, 2024 · 0 comments
Open

Improve Parallelisation #365

daviesje opened this issue Mar 28, 2024 · 0 comments
Labels
context: C backend Changes occur predominantly in the C code context: v4-prep This issue regards changes to the v4-prep branch priority: low type: performance: speed Changes affecting speed of calculations

Comments

@daviesje
Copy link
Contributor

The speedup from parallelisation in the current form of 21cmFAST drops significantly after ~4 cores. It would be nice to look into possible improvements so that we can scale better with larger boxes, or with some of the slower modes of running the model.

Some improvements could include:

  • Scheduling: The default scheduling allocates large chunks of the box to each thread, this may result in an imbalance since regions of similar density will be allocated to the same thread. Either reducing the chunk size or using dynamic scheduling for the more intensive parallel regions may help with this.
  • Parallel region structure: This will be more difficult to test (and will likely require a better profiling setup) but it's possible that splitting up some of the larger parallel loops into chunks of similar computation will help the balancing.

A more minor point on how OpenMP is written is that in the previously existing parts of the code, every variable used in a parallel region is explicitly declared as shared or private in the directive, whereas my additions rely on the default scoping, where all variables are assumed shared unless declared inside the parallel region. I wrote them like this since I believe it is more readable, however I would appreciate some input on which style is preferred by others, so that we can make it uniform across the package.

@daviesje daviesje added context: C backend Changes occur predominantly in the C code context: v4-prep This issue regards changes to the v4-prep branch priority: low type: performance: speed Changes affecting speed of calculations labels Mar 28, 2024
@daviesje daviesje moved this to Backlog in 21cmFAST v4 Apr 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
context: C backend Changes occur predominantly in the C code context: v4-prep This issue regards changes to the v4-prep branch priority: low type: performance: speed Changes affecting speed of calculations
Projects
Status: Backlog
Development

No branches or pull requests

1 participant