- Updated README to include installation and usage instructions.
- Added standard open-source tooling, including package testing and requirements.
- Add
serialization
andbeam
modules to the API. - Added utilities for creating test datasets.
- Added more examples, including a Dirichlet partitioning example.
- Removed
preprocess_fn
arg fromPartitionedDataset.build_group_stream
. Preprocessing should now be applied viatf.data.Dataset.map
.
0.2.1 - 2023-06-12
- Fixed bug where beam pipelines would not work with datasets that were not globally prepared.
0.2.0 - 2023-06-12
- Improved parallelism in Beam pipelines to speed up dataset grouping.
- Updated pyproject.toml file for initial PyPI release.
- Added more detailed README file.
0.1.0 - 2023-05-31
- Initial release