Lazy loading of zarr timestamps #3318

alejoe91 · 2024-08-20T13:25:33Z

This was previously discussed here: #2828

To sum up, loading and decompressing zarr timestamps for very long recordings can be quite time consuming, so we want to avoid doing that at init. When fetching the timestamps though, if they ar not a numpy array they are cast and cached as np.arryas, to avoid re-reading and re-decompressing at every call

h-mayorquin

Yes, I think it is better. Zarr is not the format were we want the data to be read when called.

Can you add the typing to get_times() -> np.ndaarray, this will make it clear that is returning and in memory object and also we should add a docstring describing this behavior.

h-mayorquin · 2024-08-20T15:55:51Z

src/spikeinterface/core/baserecording.py

-                return self.time_vector
-            else:
-                return np.array(self.time_vector)
+            if not isinstance(self.time_vector, np.ndarray):


I think you can just always call np.asaray() which by default will just pass the data along if it is already and np.ndarray but will create a copy if it is hdf5, zarr or a memmap.

thanks! Great suggestion!

Just to clarify, you suggest doing this?

def get_times(self) -> np.ndarray: if self.time_vector is not None: self.time_vector = np.asarray(self.time_vector) return self.time_vector else: time_vector = np.arange(self.get_num_samples(), dtype="float64") time_vector /= self.sampling_frequency if self.t_start is not None: time_vector += self.t_start return time_vector

Then done in last commit :)

Lazy loading of zarr timestamps

c123939

alejoe91 added the core Changes to core module label Aug 20, 2024

alejoe91 requested a review from h-mayorquin August 20, 2024 13:25

h-mayorquin reviewed Aug 20, 2024

View reviewed changes

alejoe91 added 2 commits August 20, 2024 18:51

asarray and annotations

522260c

Merge branch 'main' into lazy-load-zarr-times

2af1f46

h-mayorquin approved these changes Aug 20, 2024

View reviewed changes

alejoe91 merged commit a2f157c into SpikeInterface:main Aug 21, 2024
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lazy loading of zarr timestamps #3318

Lazy loading of zarr timestamps #3318

alejoe91 commented Aug 20, 2024

h-mayorquin left a comment

h-mayorquin Aug 20, 2024

alejoe91 Aug 20, 2024

alejoe91 Aug 20, 2024

h-mayorquin Aug 20, 2024

alejoe91 Aug 20, 2024

Lazy loading of zarr timestamps #3318

Lazy loading of zarr timestamps #3318

Conversation

alejoe91 commented Aug 20, 2024

h-mayorquin left a comment

Choose a reason for hiding this comment

h-mayorquin Aug 20, 2024

Choose a reason for hiding this comment

alejoe91 Aug 20, 2024

Choose a reason for hiding this comment

alejoe91 Aug 20, 2024

Choose a reason for hiding this comment

h-mayorquin Aug 20, 2024

Choose a reason for hiding this comment

alejoe91 Aug 20, 2024

Choose a reason for hiding this comment