Switch output writing to h5py to reduce memory footprint. Update version #12

thomas-a-neil · 2019-08-08T23:47:35Z

Collecting all data and then pickling can crash a machine with limited memory.

e.g. a 20GB results set (held in memory) will crash a 32GB machine. This is partially due to the memory overhead of pickling an object. Writing to hdf5 allows us to write iteratively, reducing the memory footprint overall, and avoiding the pickling overhead.

DavidMChan · 2019-08-09T00:26:42Z

Can we change this so that the behavior remains the same for code that already exists - and add a new option, "save_format" with a default to "pkl"? This would break some code that already exists - as it relies on the outputs in a pkl format. Also, I've changed this branch to develop - but that might require some different commits - it looks like some things are changed that should not be, when merging.

DavidMChan · 2019-08-09T00:27:01Z

Whoops, on my phone - closed this by accident.

thomas-a-neil · 2019-08-09T03:08:24Z

That makes sense. I'll add the flag and rebase to develop

codecov-io · 2019-08-09T20:57:52Z

Codecov Report

Merging #12 into develop will decrease coverage by 0.14%.
The diff coverage is 5.55%.

@@             Coverage Diff             @@
##           develop      #12      +/-   ##
===========================================
- Coverage     65.6%   65.46%   -0.15%     
===========================================
  Files          143      143              
  Lines         6240     6255      +15     
===========================================
+ Hits          4094     4095       +1     
- Misses        2146     2160      +14

Impacted Files	Coverage Δ
rinokeras/core/v1x/train/RinokerasGraph.py	`27.7% <5.55%> (-2.38%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update f61c48a...0927555. Read the comment docs.

thomas-a-neil · 2019-08-09T21:00:58Z

rinokeras/core/v1x/train/RinokerasGraph.py

+                    while True:
+                        loss, outputs = self.run('default', return_outputs=True)
+                        grp = f.create_group(str(i))
+                        outputs = outputs[0]  # can we rely on this being a tuple of length 1?


@DavidMChan I'm not sure about this, but can I rely on the run output always being a tuple of length 1?

Effectively, the outputs are the forward pass of your model. This means that "outputs" can be whatever you want it to be (a numpy array, a dict of numpy arrays, a tuple of arrays, etc.) This is probably why @rmrao hijacked it for use in TAPE. It also makes it tricky to write a generic saving function for the outputs since you have no guarantees on the data format. You can know, however, that the outputs will be the result of a forward run of the model (so they are convertible to tensors).

Perhaps it makes sense instead of adding options, to add a callback function? Not entirely sure, but this is why we used pickle.

I like the idea of a callback function. The save_outputs option is really more about quick-and-dirty debugging than it is a real feature at the moment.

Otherwise there's not a really good, general way of saving things. It'll vary hugely. Plus a callback would let us do things other than saving them.

thomas-a-neil mentioned this pull request Aug 8, 2019

Use h5py for output data writing and consolidation to reduce memory footprint songlab-cal/tape-neurips2019#10

Closed

DavidMChan changed the base branch from master to develop August 9, 2019 00:25

DavidMChan closed this Aug 9, 2019

DavidMChan reopened this Aug 9, 2019

thomas-a-neil force-pushed the save-outputs-h5py branch from 769a421 to 3de145c Compare August 9, 2019 20:40

Switch output writing to h5py to reduce memory footprint. Update version

b254a23

thomas-a-neil force-pushed the save-outputs-h5py branch from 3de145c to b254a23 Compare August 9, 2019 20:43

thomas-a-neil commented Aug 9, 2019

View reviewed changes

Add flag for save format

0927555

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch output writing to h5py to reduce memory footprint. Update version #12

Switch output writing to h5py to reduce memory footprint. Update version #12

thomas-a-neil commented Aug 8, 2019

DavidMChan commented Aug 9, 2019 •

edited

Loading

DavidMChan commented Aug 9, 2019

thomas-a-neil commented Aug 9, 2019

codecov-io commented Aug 9, 2019 •

edited

Loading

thomas-a-neil Aug 9, 2019

DavidMChan Aug 9, 2019

rmrao Aug 25, 2019

rmrao Aug 25, 2019

Switch output writing to h5py to reduce memory footprint. Update version #12

Are you sure you want to change the base?

Switch output writing to h5py to reduce memory footprint. Update version #12

Conversation

thomas-a-neil commented Aug 8, 2019

DavidMChan commented Aug 9, 2019 • edited Loading

DavidMChan commented Aug 9, 2019

thomas-a-neil commented Aug 9, 2019

codecov-io commented Aug 9, 2019 • edited Loading

Codecov Report

thomas-a-neil Aug 9, 2019

Choose a reason for hiding this comment

DavidMChan Aug 9, 2019

Choose a reason for hiding this comment

rmrao Aug 25, 2019

Choose a reason for hiding this comment

rmrao Aug 25, 2019

Choose a reason for hiding this comment

DavidMChan commented Aug 9, 2019 •

edited

Loading

codecov-io commented Aug 9, 2019 •

edited

Loading