Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use h5py for output data writing and consolidation to reduce memory footprint #10

Closed
wants to merge 2 commits into from

Conversation

thomas-a-neil
Copy link
Member

Building on CannyLab/rinokeras#12, the data consolidation step will read the entire output dataset into memory (which will crash for relatively small datasets if we include all encoder outputs, especially for the LSTM).

hdf5 allows us to iteratively write, and avoid the memory overhead of pickle

Upon reflection, some documentation update should probably be done as well, because I think we reference pickle a few time

@thomas-a-neil thomas-a-neil requested a review from rmrao August 8, 2019 23:49
@thomas-a-neil
Copy link
Member Author

This should also help with songlab-cal/tape#8

@rmrao
Copy link
Collaborator

rmrao commented Aug 25, 2019

Should we merge this? I don't think the rinokeras changes have been merged to master?

@thomas-a-neil
Copy link
Member Author

It depends on rinokeras changes, so I don't think we can merge it yet.

@rmrao
Copy link
Collaborator

rmrao commented Jan 7, 2020

Closing since both this and rinokeras are in basic maintenance mode now, so no major changes will be made.

@rmrao rmrao closed this Jan 7, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants