Read and Write Apache Parquet #6699

simonaubertbd · 2024-01-09T08:25:06Z

What's your use case?
Apache Parquet ( https://parquet.apache.org/ ) becomes more and more popular and I think it's like a standard now in the data community, this is no more restricted to Hadoop People. Qlik supports it, Alteryx will support it in the next release, even LibreOffice is working on it, etc, etc.
Why?
-opensource format
-fast

What's your proposed solution?
To have Orange Data Mining support Apache Parquet files for read and write.

Are there any alternative solutions?
To convert parquet files before but seems useless

markotoplak · 2024-01-09T08:43:02Z

Makes sense indeed. Orange lacks a robust and fast file format.

When I need fast reading, I resort to picked tables, but a robust format like that would be a big improvement.

simonaubertbd · 2024-10-18T09:50:58Z

Hello @markotoplak do you plan to add it in a future release ? Another point is that it would help the corporate and the research worlds communicate each other.

Best regards,

Simon

markotoplak · 2024-10-18T12:27:52Z

@simonaubertbd, first one on our list is HDF5 support, then we can also consider Parquet.

But if anyone does Parque we'll gladly merge it.

simonaubertbd · 2024-10-20T15:25:59Z

@zhuyubei Whoa, pretty impressive, congrats. Is there a pull request for that ? O_o

zhuyubei · 2024-10-20T23:40:59Z

@zhuyubei Whoa, pretty impressive, congrats. Is there a pull request for that ? O_o

But my code redesigned the whole "Table" class to store data as dataframe instead of numpy. It might not work with the current version.
Let me try to figure out how to implement a parquet reader in current Orange3 version.

markotoplak added the wish label Jan 9, 2024

janezd added the meal This will take a day or two label Jan 12, 2024

simonaubertbd mentioned this issue Oct 22, 2024

Multi-Tab User Interface #6922

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Read and Write Apache Parquet #6699

Read and Write Apache Parquet #6699

simonaubertbd commented Jan 9, 2024

markotoplak commented Jan 9, 2024 •

edited

Loading

simonaubertbd commented Oct 18, 2024

markotoplak commented Oct 18, 2024

simonaubertbd commented Oct 20, 2024

zhuyubei commented Oct 20, 2024

Read and Write Apache Parquet #6699

Read and Write Apache Parquet #6699

Comments

simonaubertbd commented Jan 9, 2024

markotoplak commented Jan 9, 2024 • edited Loading

simonaubertbd commented Oct 18, 2024

markotoplak commented Oct 18, 2024

simonaubertbd commented Oct 20, 2024

zhuyubei commented Oct 20, 2024

markotoplak commented Jan 9, 2024 •

edited

Loading