Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for reading Parquet files #576

Open
fb64 opened this issue Jan 28, 2024 · 4 comments · May be fixed by #577
Open

Add support for reading Parquet files #576

fb64 opened this issue Jan 28, 2024 · 4 comments · May be fixed by #577
Labels
enhancement New feature or request
Milestone

Comments

@fb64
Copy link
Contributor

fb64 commented Jan 28, 2024

No description provided.

@zaleslaw
Copy link
Collaborator

Thanks for the issue, could you please provide a use-case for this task? Do you have data in this format to explore?

@fb64
Copy link
Contributor Author

fb64 commented Jan 29, 2024

IMHO Parquet format is now a standard in term of data exchange format as it provides both good performance and good compression rate. Lot of data can be found in this format and it is supported by most data science tools.
In my PR you can find a parquet file example.
Some open data could also be found as parquet file like Newyork Yellow taxi trip for example : https://www.nyc.gov/site/tlc/about/tlc-trip-record-data.page

@Jolanrensen
Copy link
Collaborator

Can confirm, Parquet support would be nice, it's used a lot with Spark as well.

@Jolanrensen Jolanrensen added the enhancement New feature or request label Jan 29, 2024
@kopczynski-9livesdata
Copy link

I second this. Parquet format is massively popular when working with Spark and having this in Kotlin Dataframe would be great.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants