Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

__fenix_repair_ranks could hang due to permanent failure to repair communicators #27

Open
rfvander opened this issue Mar 30, 2017 · 0 comments
Labels

Comments

@rfvander
Copy link
Contributor

The whole program could hang if success of some of the repair operations remains elusive (endless loop over “while (!repair_success) {“). I think I remarked earlier that there was a stray variable num_try that gets incremented but not used that by function. Evidently, it was introduced to set some kind of limit on attempts to repair communicators. I think we should introduce an environment variable to govern the number of retries, with a default value set to a reasonably large number.

@rfvander rfvander added the bug label Mar 30, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant