Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add and/or compute functions #481
Add and/or compute functions #481
Changes from 12 commits
47535d4
116a5c8
203dcf1
952d883
acef812
d988a02
7f5134b
17ac296
6257749
2d4fe51
125e54e
c3b8a83
c91361f
38d66fc
18ab244
8cd4414
976139c
381aa3c
d29521d
0decc3d
038eb60
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should this maybe dispatch on the right hand side instead of converting to arrow?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
definitely possible, but I think we might be missing some larger abstraction here to solve the whole left/right issue
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe first check if either scalar is_null, and then I think the more canonical conversion would be
bool::try_from(array.scalar())?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should chat through the scalar API next week, it's a bit weird, you might have some ideas to improve
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Either this fallback will fail (into_bool when it's not?), or into_bool does an expensive expansion of the constant.
I think the best thing is to have the convention that constant goes on the RHS, so we should fallback to
and(other, constant_array)
to allow the RHS encoding a chance to be efficient, e.g. if it's run-end it only needs to run over the values.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I can't actually fallback intoFound a solution I'm happy with here.AndFn
here because this might also be anor
, I'm trying to think of a better way of generalizing boolean ops here (I do agree with trying to letrhs
to give us a more efficient implementation)There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This isn't zero copy is it? We can add a function to PrimitiveArray to construct directly from a buffer (for PType==u8?)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Alternatively, @robert3005 did we discuss VarBin data being a buffer instead of a child array?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we thought about it but then we wanted to have something like ZstdEncoding
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I’m starting to think general purpose compression can be configured on buffers at write-time though; using the layouts mechanism.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not zero-copy, but still pretty cheap IMO. Constructing things from
Bytes
is hard because there's no guarantee the instance is exclusiveThere was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
But PrimitiveArray wraps a vortex-buffer, which itself wraps Bytes. So this copy is purely because the right API isn't exposed / isn't used
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
true, fixing. Added a
from_bytes
function.