Skip to content
This repository has been archived by the owner on Mar 22, 2021. It is now read-only.

Replicability in parallelized code #38

Open
rgreminger opened this issue Nov 16, 2020 · 0 comments
Open

Replicability in parallelized code #38

rgreminger opened this issue Nov 16, 2020 · 0 comments

Comments

@rgreminger
Copy link

Hi,

Another addition to the page that could be potentially useful would be to have resources showing how to ensure replicability when running things in parallel. Specifically, when drawing random numbers in a function that then is run multiple times in parallel (e.g. run independent MC simulations), it will produce different draws depending on the number of cores on which the code is run, unless the seed is set in a specific way (e.g. only using set.seed(2) in R is not enough).

I'm not sure how well recognized this issue is (I've seen replication packages that completely ignore this, leading to different results when I executed the code on my machine), so it may already be useful to just generally highlight this. But let me know what you think, and if you think this would be useful, I (or someone else) can add a section on this at one point.

Best,
Rafael

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant