Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
chore(deps): update arrow to v52.2.0 (#523)
[![Mend Renovate](https://app.renovatebot.com/images/banner.svg)](https://renovatebot.com) This PR contains the following updates: | Package | Type | Update | Change | |---|---|---|---| | [arrow](https://togithub.com/apache/arrow-rs) | workspace.dependencies | minor | `52.1.0` -> `52.2.0` | | [arrow-arith](https://togithub.com/apache/arrow-rs) | workspace.dependencies | minor | `52.1.0` -> `52.2.0` | | [arrow-array](https://togithub.com/apache/arrow-rs) | workspace.dependencies | minor | `52.1.0` -> `52.2.0` | | [arrow-buffer](https://togithub.com/apache/arrow-rs) | workspace.dependencies | minor | `52.1.0` -> `52.2.0` | | [arrow-cast](https://togithub.com/apache/arrow-rs) | workspace.dependencies | minor | `52.1.0` -> `52.2.0` | | [arrow-csv](https://togithub.com/apache/arrow-rs) | workspace.dependencies | minor | `52.1.0` -> `52.2.0` | | [arrow-data](https://togithub.com/apache/arrow-rs) | workspace.dependencies | minor | `52.1.0` -> `52.2.0` | | [arrow-ipc](https://togithub.com/apache/arrow-rs) | workspace.dependencies | minor | `52.1.0` -> `52.2.0` | | [arrow-ord](https://togithub.com/apache/arrow-rs) | workspace.dependencies | minor | `52.1.0` -> `52.2.0` | | [arrow-schema](https://togithub.com/apache/arrow-rs) | workspace.dependencies | minor | `52.1.0` -> `52.2.0` | | [arrow-select](https://togithub.com/apache/arrow-rs) | workspace.dependencies | minor | `52.1.0` -> `52.2.0` | | [parquet](https://togithub.com/apache/arrow-rs) | workspace.dependencies | minor | `52.1.0` -> `52.2.0` | --- ### Release Notes <details> <summary>apache/arrow-rs (arrow)</summary> ### [`v52.2.0`](https://togithub.com/apache/arrow-rs/blob/HEAD/CHANGELOG.md#5220-2024-07-24) [Compare Source](https://togithub.com/apache/arrow-rs/compare/52.1.0...52.2.0) [Full Changelog](https://togithub.com/apache/arrow-rs/compare/52.1.0...52.2.0) **Implemented enhancements:** - Faster min/max for string/binary view arrays [#​6088](https://togithub.com/apache/arrow-rs/issues/6088) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] - Support casting to/from Utf8View [#​6076](https://togithub.com/apache/arrow-rs/issues/6076) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] - Min/max support for String/BinaryViewArray [#​6052](https://togithub.com/apache/arrow-rs/issues/6052) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] - Improve performance of constructing `ByteView`s for small strings [#​6034](https://togithub.com/apache/arrow-rs/issues/6034) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] - Fast UTF-8 validation when reading StringViewArray from Parquet [#​5995](https://togithub.com/apache/arrow-rs/issues/5995) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] - Optimize StringView row decoding [#​5945](https://togithub.com/apache/arrow-rs/issues/5945) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] - Implementing `deduplicate` / `intern` functionality for StringView [#​5910](https://togithub.com/apache/arrow-rs/issues/5910) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] - Add `FlightSqlServiceClient::new_from_inner` [#​6003](https://togithub.com/apache/arrow-rs/pull/6003) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] \[[arrow-flight](https://togithub.com/apache/arrow-rs/labels/arrow-flight)] ([lewiszlw](https://togithub.com/lewiszlw)) - Complete `StringViewArray` and `BinaryViewArray` parquet decoder: [#​6004](https://togithub.com/apache/arrow-rs/pull/6004) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Add begin/end_transaction methods in FlightSqlServiceClient [#​6026](https://togithub.com/apache/arrow-rs/pull/6026) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] \[[arrow-flight](https://togithub.com/apache/arrow-rs/labels/arrow-flight)] ([lewiszlw](https://togithub.com/lewiszlw)) - Read Parquet statistics as arrow `Arrays` [#​6046](https://togithub.com/apache/arrow-rs/pull/6046) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([efredine](https://togithub.com/efredine)) **Fixed bugs:** - Panic in `ParquetMetadata::memory_size` if no min/max set [#​6091](https://togithub.com/apache/arrow-rs/issues/6091) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] - BinaryViewArray doesn't roundtrip a single `Some(&[])` through parquet [#​6086](https://togithub.com/apache/arrow-rs/issues/6086) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] - Parquet `ColumnIndex` for null columns is written even when statistics are disabled [#​6010](https://togithub.com/apache/arrow-rs/issues/6010) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] **Documentation updates:** - Fix typo in GenericByteViewArray documentation [#​6054](https://togithub.com/apache/arrow-rs/pull/6054) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([progval](https://togithub.com/progval)) - Minor: Improve parquet PageIndex documentation [#​6042](https://togithub.com/apache/arrow-rs/pull/6042) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([alamb](https://togithub.com/alamb)) **Closed issues:** - Potential performance improvements for reading Parquet to StringViewArray/BinaryViewArray [#​5904](https://togithub.com/apache/arrow-rs/issues/5904) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] **Merged pull requests:** - Faster `GenericByteView` construction [#​6102](https://togithub.com/apache/arrow-rs/pull/6102) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Add benchmark to track byte-view construction performance [#​6101](https://togithub.com/apache/arrow-rs/pull/6101) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Optimize `bool_or` using `max_boolean` [#​6100](https://togithub.com/apache/arrow-rs/pull/6100) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([simonvandel](https://togithub.com/simonvandel)) - Optimize `max_boolean` by operating on u64 chunks [#​6098](https://togithub.com/apache/arrow-rs/pull/6098) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([simonvandel](https://togithub.com/simonvandel)) - fix panic in `ParquetMetadata::memory_size`: check has_min_max_set before invoking min()/max() [#​6092](https://togithub.com/apache/arrow-rs/pull/6092) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([Fischer0522](https://togithub.com/Fischer0522)) - Implement specialized min/max for `GenericBinaryView` (`StringView` and `BinaryView`) [#​6089](https://togithub.com/apache/arrow-rs/pull/6089) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Add PartialEq to ParquetMetaData and FileMetadata [#​6082](https://togithub.com/apache/arrow-rs/pull/6082) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([adriangb](https://togithub.com/adriangb)) - Enable casting from Utf8View [#​6077](https://togithub.com/apache/arrow-rs/pull/6077) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([a10y](https://togithub.com/a10y)) - StringView support in arrow-csv [#​6062](https://togithub.com/apache/arrow-rs/pull/6062) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([2010YOUY01](https://togithub.com/2010YOUY01)) - Implement min max support for string/binary view types [#​6053](https://togithub.com/apache/arrow-rs/pull/6053) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Minor: clarify the relationship between `file::metadata` and `format` in docs [#​6049](https://togithub.com/apache/arrow-rs/pull/6049) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([alamb](https://togithub.com/alamb)) - Minor API adjustments for StringViewBuilder [#​6047](https://togithub.com/apache/arrow-rs/pull/6047) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Add parquet `StatisticsConverter` for arrow reader [#​6046](https://togithub.com/apache/arrow-rs/pull/6046) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([efredine](https://togithub.com/efredine)) - Directly decode String/BinaryView types from arrow-row format [#​6044](https://togithub.com/apache/arrow-rs/pull/6044) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Clean up unused code for view types in offset buffer [#​6040](https://togithub.com/apache/arrow-rs/pull/6040) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Avoid using Buffer api that accidentally copies data [#​6039](https://togithub.com/apache/arrow-rs/pull/6039) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] \[[arrow-flight](https://togithub.com/apache/arrow-rs/labels/arrow-flight)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - MINOR: Fix `hashbrown` version in `arrow-array`, remove from `arrow-row` [#​6035](https://togithub.com/apache/arrow-rs/pull/6035) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([mbrobbel](https://togithub.com/mbrobbel)) - Improve performance reading `ByteViewArray` from parquet by removing an implicit copy [#​6031](https://togithub.com/apache/arrow-rs/pull/6031) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Add begin/end_transaction methods in FlightSqlServiceClient [#​6026](https://togithub.com/apache/arrow-rs/pull/6026) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] \[[arrow-flight](https://togithub.com/apache/arrow-rs/labels/arrow-flight)] ([lewiszlw](https://togithub.com/lewiszlw)) - Unsafe improvements: core `parquet` crate. [#​6024](https://togithub.com/apache/arrow-rs/pull/6024) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([veluca93](https://togithub.com/veluca93)) - Additional tests for parquet reader utf8 validation [#​6023](https://togithub.com/apache/arrow-rs/pull/6023) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([alamb](https://togithub.com/alamb)) - Update zstd-sys requirement from >=2.0.0, <2.0.12 to >=2.0.0, <2.0.13 [#​6019](https://togithub.com/apache/arrow-rs/pull/6019) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([dependabot\[bot\]](https://togithub.com/apps/dependabot)) - fix doc ci in latest rust nightly version [#​6012](https://togithub.com/apache/arrow-rs/pull/6012) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] \[[arrow-flight](https://togithub.com/apache/arrow-rs/labels/arrow-flight)] ([Rachelint](https://togithub.com/Rachelint)) - Do not write `ColumnIndex` for null columns when not writing page statistics [#​6011](https://togithub.com/apache/arrow-rs/pull/6011) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([etseidl](https://togithub.com/etseidl)) - Fast utf8 validation when loading string view from parquet [#​6009](https://togithub.com/apache/arrow-rs/pull/6009) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Deduplicate strings/binarys when building view types [#​6005](https://togithub.com/apache/arrow-rs/pull/6005) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Complete `StringViewArray` and `BinaryViewArray` parquet decoder: implement delta byte array and delta length byte array encoding [#​6004](https://togithub.com/apache/arrow-rs/pull/6004) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Add `FlightSqlServiceClient::new_from_inner` [#​6003](https://togithub.com/apache/arrow-rs/pull/6003) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] \[[arrow-flight](https://togithub.com/apache/arrow-rs/labels/arrow-flight)] ([lewiszlw](https://togithub.com/lewiszlw)) - Rename `Schema::all_fields` to `flattened_fields` [#​6001](https://togithub.com/apache/arrow-rs/pull/6001) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] \[[arrow-flight](https://togithub.com/apache/arrow-rs/labels/arrow-flight)] ([lewiszlw](https://togithub.com/lewiszlw)) - Refine documentation and examples for `DataType` [#​5997](https://togithub.com/apache/arrow-rs/pull/5997) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([alamb](https://togithub.com/alamb)) - implement `DataType::try_form(&str)` [#​5994](https://togithub.com/apache/arrow-rs/pull/5994) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([samuelcolvin](https://togithub.com/samuelcolvin)) - Implement dictionary support for reading ByteView from parquet [#​5973](https://togithub.com/apache/arrow-rs/pull/5973) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([XiangpengHao](https://togithub.com/XiangpengHao)) \* *This Changelog was automatically generated by [github_changelog_generator](https://togithub.com/github-changelog-generator/github-changelog-generator)* </details> --- ### Configuration 📅 **Schedule**: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined). 🚦 **Automerge**: Enabled. ♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox. 🔕 **Ignore**: Close this PR and you won't be reminded about these updates again. --- - [ ] <!-- rebase-check -->If you want to rebase/retry this PR, check this box --- This PR was generated by [Mend Renovate](https://www.mend.io/free-developer-tools/renovate/). View the [repository job log](https://developer.mend.io/github/spiraldb/vortex). <!--renovate-debug:eyJjcmVhdGVkSW5WZXIiOiIzNy40MzguMCIsInVwZGF0ZWRJblZlciI6IjM3LjQzOC4wIiwidGFyZ2V0QnJhbmNoIjoiZGV2ZWxvcCIsImxhYmVscyI6W119--> Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
- Loading branch information