You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
===================================================================================
= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= PID 121038 RUNNING AT c405-013
= EXIT CODE: 11
= CLEANING UP REMAINING PROCESSES
= YOU CAN IGNORE THE BELOW CLEANUP MESSAGES
===================================================================================
Intel(R) MPI Library troubleshooting guide:
https://software.intel.com/node/561764
===================================================================================
TACC: MPI job exited with code: 11
TACC: Shutdown complete. Exiting.
I believe it leads to other unexpected behaviors when executed in the middle of a more complicated program, such as taking over 30 minutes (and getting cancelled due to time limit) for such small tensors or the following error:
c405-041.stampede2.tacc.utexas.edu.75166Received eager message(s) ptype=0x1 opcode=0xcc from an unknown process (err=49)
The program behaved as expected using either 128 nodes 64 ppn or 256 nodes 64 ppn, however.
The text was updated successfully, but these errors were encountered:
The following code snippet failed with MPI exit code 11 on Stampede2 using 256 nodes and 64 ppn.
gives the following output:
I believe it leads to other unexpected behaviors when executed in the middle of a more complicated program, such as taking over 30 minutes (and getting cancelled due to time limit) for such small tensors or the following error:
The program behaved as expected using either 128 nodes 64 ppn or 256 nodes 64 ppn, however.
The text was updated successfully, but these errors were encountered: