-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
What order should results be in? #62
Comments
|
Definitely default to genomic location (chr+start), option to use another column? |
Some of these tables don't have loci information. E.g. ibis.common.exceptions.IbisTypeError: Column 'chrom' is not found in table. Existing columns: 'gene_id', 'gene_name', 'gene_biotype', 'gene_seq_start', 'gene_seq_end', 'seq_name', 'seq_strand', 'seq_coord_system', 'description', 'gene_id_version', 'canonical_transcript'. |
@thomas-reimonn Ah, I think @emdann, for something like And then, do we consider strand for sorting exons within a transcript? |
It looks like |
I also agree by default it should be chrom + start. It seems like this is also what GenomicFeatures does and they have a separate function that returns ranges sorted by another value I think: e.g.
|
I think the bioc packages go by chrom + start for the "primary" thing being queried (e.g. genes for DemoSetuplibrary(ensembldb)
library(EnsDb.Hsapiens.v86)
edb = EnsDb.Hsapiens.v86
library(GenomicFeatures)
samplefile <- system.file("extdata", "hg19_knownGene_sample.sqlite",
package="GenomicFeatures")
txdb <- loadDb(samplefile)
|
Description of feature
Continuing from #59 (comment)
In the mentioned PR, we found out that duckdb is inconsistent with what order it returns results in. This by and large seems fine from a duckdb point of view, but would be frustrating for users of this package.
To solve this @thomas-reimonn added an
order_by
statement hereRight now the order returned is based on the order of columns in the user input (I believe). Is there a better/ more canonical way to do this? Off the top of my head I would assume we'd generally want to sort by
chrom
andstart
.We should also check what the bioc packages do here.
@nvictus, @emdann, @lauradmartens any thoughts?
The text was updated successfully, but these errors were encountered: