v0.2.3
🚀 Streaming v0.2.3
Streaming v0.2.3 is released! Install via pip:
pip install --upgrade mosaicml-streaming==0.2.3
New Features
- Add scalar MDS encodings data types (#130)
- Support of WebVid-10M dataset (#132)
- Support of LAION-400M dataset (#87)
- Make
StreamingDataset[sample_id]
block to download the given sample's shard if it is not present, so that the dataset can be used lazily (#118) - Support of a Streaming benchmarking script to get time taken by the individual component (#121)
Bug Fixes
- Nuke concat option in C4 dataset (#129)
- Fixed bug report markdown doc (#140)
- Fixed ADE20K dataset conversion script (#133)
What's Changed
- Make getitem block to download shard if not present. by @knighton in #118
- 2022 -> 2023. by @knighton in #119
- Benchmark generating the epoch. by @knighton in #121
- Move datasets dependency into .[dev]. by @knighton in #123
- Bump sphinxcontrib-katex from 0.9.3 to 0.9.4 by @dependabot in #113
- Bump sphinxext-opengraph from 0.7.4 to 0.7.5 by @dependabot in #114
- Bump pytest from 7.2.0 to 7.2.1 by @dependabot in #124
- Bump fastapi from 0.88.0 to 0.89.1 by @dependabot in #125
- Bump yamllint from 1.28.0 to 1.29.0 by @dependabot in #126
- Update paramiko requirement from <3,>=2.11.0 to >=2.11.0,<4 by @dependabot in #127
- Bump nbsphinx from 0.8.11 to 0.8.12 by @dependabot in #128
- Nuke concat option. by @knighton in #129
- Add scalar MDS encodings (data types). by @knighton in #130
- WebVid. by @knighton in #132
- LAION-400M processing by @knighton in #87
- Update isort version by @karan6181 in #135
- Update pre-commit requirement from <3,>=2.18.1 to >=2.18.1,<4 by @dependabot in #134
- Fixed bug report markdown by @karan6181 in #140
- Fix ade20k conversion script by @dblalock in #133
- bump version to 0.2.3 by @karan6181 in #141
Full Changelog: v0.2.2...v0.2.3