Skip to content

Using v6d to share df across 2 independent process with 0-copy and without deserialization #1591

Answered by sighingnow
meta-ks asked this question in Q&A
Discussion options

You must be logged in to vote

I have tried to reproduce such dataframes, you can see puting/getting arrow tables are quite faster than opertions on pandas dataframe, and the to_pandas/from_pandas is time consuming.

testing put/get pandas dataframe:
Time elapsed for <function put_vineyard at 0x7fe1d29e6b00>: 5.225 s
Time elapsed for <function get_vineyard at 0x7fe1d29e6c20>: 4.678 s
testing put/get arrow table:
Time elapsed for <function to_arrow_table at 0x7fe1d29c0d30>: 3.333 s
Time elapsed for <function put_vineyard at 0x7fe1d29e6b00>: 0.232 s
Time elapsed for <function get_vineyard at 0x7fe1d29e6c20>: 0.108 s
Time elapsed for <function to_pandas_dataframe at 0x7fe1d29c1480>: 4.644 s
#!/usr/bin/env python3

import f…

Replies: 5 comments 4 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
3 replies
@meta-ks
Comment options

@sighingnow
Comment options

@sighingnow
Comment options

Comment options

You must be logged in to vote
1 reply
@sighingnow
Comment options

Answer selected by sighingnow
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants