Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

using mclapply within rcolgem locking up Kamphir #29

Open
ArtPoon opened this issue May 1, 2015 · 11 comments
Open

using mclapply within rcolgem locking up Kamphir #29

ArtPoon opened this issue May 1, 2015 · 11 comments

Comments

@ArtPoon
Copy link
Owner

ArtPoon commented May 1, 2015

This problem seems to be specific to OS-X. I haven't reproduced it on a Linux cluster. Monitoring PIDs, I can see that the Python threads are constant throughout the run, but new threads are constantly being spawned by rcolgem. Eventually this locks up the run.

@ArtPoon
Copy link
Owner Author

ArtPoon commented May 1, 2015

Maybe this has something to do with default limits on number of processes in OS-X?
ulimit is set to unlimited, but launchctl limit has maxproc set to 1064. Might this be causing Kamphir to lock up?

@ArtPoon ArtPoon closed this as completed in bf1404f May 1, 2015
@ArtPoon
Copy link
Owner Author

ArtPoon commented May 4, 2015

Still encountering problems in OS-X despite resolving the process ID usage issue.

@ArtPoon ArtPoon reopened this May 4, 2015
@rmcclosk
Copy link
Collaborator

rmcclosk commented May 5, 2015

The command that's locking up the program is

python kamphir.py DiffRisk settings.rcolgem-DiffRisk1.json rcolgem_c1-2.0_n-300_rho-0.9.nwk diffrisk-c1-2.log -kdecay 0.3 -tol0 0.005 -mintol 0.0025 -ncores 4 -nthreads 4 -nreps 20 -treenum 0

@rmcclosk
Copy link
Collaborator

rmcclosk commented May 5, 2015

This crashes on my machine. I get this error.

Error in unserialize(node$con) : error reading from connection
Error in serialize(data, node$con, xdr = FALSE) : 
  error writing to connection

The second two lines repeat about 30 times, and then I get a segfault message from Python.

@ArtPoon
Copy link
Owner Author

ArtPoon commented May 5, 2015

Interesting, I haven't encountered this one. It looks like a 'dill' related issue.

On May 5, 2015, at 9:48 AM, Rosemary McCloskey [email protected] wrote:

This crashes on my machine. I get this error.

Error in unserialize(node$con) : error reading from connection
Error in serialize(data, node$con, xdr = FALSE) :
error writing to connection

The second two lines repeat about 30 times, and then I get a segfault message from Python.


Reply to this email directly or view it on GitHub.

@ArtPoon
Copy link
Owner Author

ArtPoon commented May 5, 2015

Maybe it has something to do with user permissions on loading R libraries?

http://stackoverflow.com/questions/24583793/error-reading-from-connection-on-loading-package-on-unix

On May 5, 2015, at 9:50 AM, Art Poon [email protected] wrote:

Interesting, I haven't encountered this one. It looks like a 'dill' related issue.

On May 5, 2015, at 9:48 AM, Rosemary McCloskey [email protected] wrote:

This crashes on my machine. I get this error.

Error in unserialize(node$con) : error reading from connection
Error in serialize(data, node$con, xdr = FALSE) :
error writing to connection

The second two lines repeat about 30 times, and then I get a segfault message from Python.


Reply to this email directly or view it on GitHub.

@rmcclosk
Copy link
Collaborator

rmcclosk commented May 6, 2015

Reinstalling dill fixed the above. Unable to reproduce the hang on my workstation so far.

@ArtPoon
Copy link
Owner Author

ArtPoon commented May 6, 2015

Did you install a different version of dill? If so, please record the version number that caused this issue. If not, this is very odd.

@rmcclosk
Copy link
Collaborator

rmcclosk commented May 6, 2015

Um, I think it was a shared library problem due to different versions of python (possibly also of R) installed on my machine. I installed dill with pip and reinstalled R from source using the --enable-R-shlib option. I'm not sure which of those two things fixed the issue, but at any rate I have written them both down in the install documentation.

@ArtPoon
Copy link
Owner Author

ArtPoon commented May 6, 2015

Well, you need to compile R with --enable-R-shlib in order to get the rpy2 module to work. Sorry that's not documented :-(

@rmcclosk
Copy link
Collaborator

rmcclosk commented May 6, 2015

It is now, or anyway it will be once I make a pull request.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants