Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

File-based table provider for Datafusion #546

Merged
merged 25 commits into from
Aug 6, 2024

Conversation

AdamGS
Copy link
Contributor

@AdamGS AdamGS commented Aug 5, 2024

This PR is a very minimal TableProvider to read vortex-encoded files.

@AdamGS AdamGS marked this pull request as draft August 5, 2024 14:04
@AdamGS AdamGS requested review from a10y and robert3005 August 5, 2024 16:38
@AdamGS AdamGS marked this pull request as ready for review August 5, 2024 16:39
@AdamGS AdamGS changed the title [WIP] file table provider for Datafusion File-based table provider for Datafusion Aug 5, 2024
builder = builder.with_projection(Projection::new(projection))
}

DFResult::Ok(
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should figure out fmt/clippy rule for this

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There's a rule, however, it's very fiddly to configure. Will have a look at some point if there's a simple setup

@robert3005
Copy link
Member

I wonder how datafusion handles massive file listing. I have a feeling it might just oom

@AdamGS AdamGS enabled auto-merge (squash) August 6, 2024 09:54
@AdamGS AdamGS merged commit 5a8c2d8 into develop Aug 6, 2024
3 checks passed
@AdamGS AdamGS deleted the adamg/vortex-datafusion-disk-provider branch August 6, 2024 09:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants