You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The file should have 3 numbers with accurate rowgroup metadata.
Actual behaviour
This writes a file with 3 numbers but corrupt metadata - the max_value of the row group is the 128 value instead of the 129 value.
➜ parquet meta corrupt.parquet [12:39:23]
File path: corrupt.parquet
Created by: @dsnp/parquetjs
Properties: (none)
Schema:
message root {
required binary num (DECIMAL(20,0));
}
Row group 0: count: 3 37.00 B records start: 4 total(compressed): 111 B total(uncompressed):186 B
--------------------------------------------------------------------------------
type encodings count avg size nulls min / max
num BINARY _ BB_ 3 37.00 B 0 "1000000000000127" / "1000000000000128"
➜ parquet check-stats corrupt.parquet [12:39:30]
corrupt.parquet has corrupt stats: Max should be >= all values.
Any logs, error output, etc?
n/a
Any other comments?
I think this is because the library uses < and > instead of Buffer.compare() to determine the statistics min_value/max_values.
The text was updated successfully, but these errors were encountered:
Steps to reproduce
This program reproduces it on 1.7.0:
Expected behaviour
The file should have 3 numbers with accurate rowgroup metadata.
Actual behaviour
This writes a file with 3 numbers but corrupt metadata - the max_value of the row group is the 128 value instead of the 129 value.
Any logs, error output, etc?
n/a
Any other comments?
I think this is because the library uses
<
and>
instead ofBuffer.compare()
to determine the statistics min_value/max_values.The text was updated successfully, but these errors were encountered: