Configurable compression #66

clbarnes · 2019-07-16T18:03:45Z

As mentioned in #65.

Getting arbitrary compression options across the pyo3 interface seems like it could be a pain. Also, choosing an API may require a bit of thought:

The h5py way

Takes a compression: Optional[str]=None, and a compression_opts: Optional[Any]=None. For level-5 gzip, these would be "gzip" and 5. h5py doesn't support that many compression types, but n5 may want to support things which fairly complicated configurations, as the config is just stored as a JSON object. Zarr supports compression layers. Splitting across multiple arguments is a bit unfortunate.

the z5 way

**compression_opts are kwargs. This is nice because it means the type of each option can be consistent, but it means that you prevent yourself adding any other **kwargs in the future.

The strongly-typed way

As rust (and presumably java) does: take a single object whose type describes the type of compression, and whose members configure that compression. This would be discoverable and hopefully easy to map to rust (namedtuple -> struct). But it means duplicating some of the compression-configuration machinery from rust into python.

The sketchy MVP way

Whenever you create a dataset (writing its attributes.json), python then re-opens the attributes file and dumps an arbitrary compression JSON object in there, hopefully before any data is written.

The text was updated successfully, but these errors were encountered:

aschampion · 2019-07-16T18:15:04Z

rust-n5's compression is serde deserializable, so a sketchy MVP way already exists just passing JSON through ({"type": "gzip", "level": 5}).

clbarnes · 2019-09-18T16:14:27Z

Released the h5py way in v1.0.0 (implemented using the sketchy MVP way)

clbarnes closed this as completed Sep 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configurable compression #66

Configurable compression #66

clbarnes commented Jul 16, 2019

aschampion commented Jul 16, 2019

clbarnes commented Sep 18, 2019 •

edited

Loading

Configurable compression #66

Configurable compression #66

Comments

clbarnes commented Jul 16, 2019

The h5py way

the z5 way

The strongly-typed way

The sketchy MVP way

aschampion commented Jul 16, 2019

clbarnes commented Sep 18, 2019 • edited Loading

clbarnes commented Sep 18, 2019 •

edited

Loading