Releases: iterative/datachain
Releases · iterative/datachain
0.6.4
0.6.3
0.6.2
What's Changed
- Refactor the configuration parsing by @amritghimire in #513
- Add studio auth to datachain by @amritghimire in #514
- patch unstructured embeddings gen example by @mattseddon in #520
- Adding FSSpec Export for CSV and Parquet by @dtulga in #516
- Window functions by @dreadatour in #515
- Refactor file system source and path by @ilongin in #496
- Update numpy requirement from <2,>=1 to >=1,<3 by @dependabot in #498
- [pre-commit.ci] pre-commit autoupdate by @pre-commit-ci in #528
- Bump mypy from 1.12.0 to 1.12.1 by @dependabot in #529
- Update pillow requirement from <11,>=10.0.0 to >=10.0.0,<12 by @dependabot in #531
- Add option to set team in config for studio by @amritghimire in #527
- Implement ls-datasets for studio client by @amritghimire in #526
- Adding JSON / JSON Lines Export Support by @dtulga in #538
- datachain: support mutating existing column by @skshetry in #537
- Fix CLI to work with
DataChain
new listing by @ilongin in #517
Full Changelog: 0.6.1...0.6.2
0.6.1
What's Changed
- drop DatasetRow outdated class by @shcheklein in #505
- Bump mypy from 1.11.2 to 1.12.0 by @dependabot in #512
- Add a way to persist dataset even if exception is thrown by @amritghimire in #504
- remove ordering by sys.id by @mattseddon in #507
- Implement chain group_by by @dreadatour in #482
Full Changelog: 0.6.0...0.6.1
0.6.0
What's Changed
- Add string replace function by @shcheklein in #508
- Add column types to from_csv to override auto inference by @shcheklein in #506
Full Changelog: 0.5.1...0.6.0
0.5.1
What's Changed
- fix: print each statement on a separate line when on debug mode by @skshetry in #479
- Merge datachain.query.udf into datachain.lib.udf and clean-up by @rlamy in #483
- [pre-commit.ci] pre-commit autoupdate by @pre-commit-ci in #485
- Parquet Import+Export with SignalSchema by @dtulga in #480
- use returning in get_next_ids by @mattseddon in #484
- Adding Streaming CSV Export by @dtulga in #488
- Remove code duplication: UDFBase._parse_grouped_rows() by @rlamy in #490
- Don't allow mappers to skip rows by @rlamy in #491
- Make datachain queries atomic when exception occurs by @amritghimire in #494
Full Changelog: 0.5.0...0.5.1
0.5.0
What's Changed
- Split DatasetQuery from DataChain by @rlamy in #459
- remove legacy shadow attribute by @mattseddon in #478
Full Changelog: 0.4.0...0.5.0
0.4.0
0.3.20
0.3.19
What's Changed
- Reintroduce and update test_udf_after_limit() by @rlamy in #458
IndexedFile
->ArrowRow
by @dberenbaum in #445- assert each example has some output instead of stdout and stderr by @mattseddon in #468
- query: remove compat for executing last query expression by @skshetry in #449
- Introduce DatasetVersionNotFoundError in errors by @amritghimire in #461
- use official github action for uv and
uv build
by @skshetry in #470
Full Changelog: 0.3.18...0.3.19