Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate why Dice command line application always returns non-zero exit codes #30

Open
caleb-johnson opened this issue Sep 25, 2024 · 0 comments

Comments

@caleb-johnson
Copy link
Collaborator

caleb-johnson commented Sep 25, 2024

Even on successful runs, Dice returns a non-zero exit code and gives an MPI error in the output. Investigate why this is happening and how we can get Dice to return 0 on successful execution.

**************************************************************
CALCULATING RDMs
**************************************************************
Error here
--------------------------------------------------------------------------
mpirun has exited due to process rank 1 with PID 0 on <MACHINE_NAME> exiting improperly. There are three reasons this could occur:

1. this process did not call "init" before exiting, but others in
the job did. This can cause a job to hang indefinitely while it waits
for all processes to call "init". By rule, if one process calls "init",
then ALL processes must call "init" prior to termination.

2. this process called "init", but exited without calling "finalize".
By rule, all processes that call "init" MUST call "finalize" prior to
exiting or it will be considered an "abnormal termination"

3. this process called "MPI_Abort" or "orte_abort" and the mca parameter
orte_create_session_dirs is set to false. In this case, the run-time cannot
detect that the abort call was an abnormal termination. Hence, the only
error message you will receive is this one.

This may have caused other processes in the application to be
terminated by signals sent by mpirun (as reported here).

You can avoid this message by specifying -quiet on the mpirun command line.
--------------------------------------------------------------------------
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant