Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Filter scan data frame #8

Open
wants to merge 5 commits into
base: dev
Choose a base branch
from

Conversation

l0s
Copy link

@l0s l0s commented May 22, 2014

This provides a function that scans an HBase table given a filter string and returns a data frame. It also refactors the common logic shared between hb.scan.data.frame and hb.scan.ex.data.frame.

Discussion: https://groups.google.com/d/msg/rhadoop/Oty-A0UodTM/ngbLVAFRX0oJ .

l0s added 5 commits May 21, 2014 20:58
The row names of the resulting data frame are the keys of the HBase
table. The column names are those specified in colspec.  If a row does
not have a value for a column specified in colspec, then that cell will
be populated with NA.

Prep work to allow all columns in a column family.

Use cols instead of colspec when getting column values.

modularise helper methods
@piccolbo
Copy link
Collaborator

This has all the commits of #7 plus one. Isn't it possible to base this on that? I need to think a moment about what exactly that means in git. Initially I thought that filter scan was an independent feature. It seems like it should, right? Whether rows are returned as a data frame or list, applying a filter should make sense.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants