Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parsing of annotation / query by condition #7

Open
julien-roux opened this issue Dec 14, 2016 · 4 comments
Open

Parsing of annotation / query by condition #7

julien-roux opened this issue Dec 14, 2016 · 4 comments

Comments

@julien-roux
Copy link
Contributor

julien-roux commented Dec 14, 2016

(Issue originally opened at wirawara/BgeeDB#2)

It would be nice to have a function allowing to easily parse the Affymetrix and RNA-seq annotations data frames. For example it might be super useful to:

  • query tissues and stages IDs using free-text strings
  • for a given stage or organ ID, retrieve all parent/children stages/organs. This could be done with a webservice query very similar to what is done in loadTopAnatData.R to get the organ.relationships list.
  • retrieve all samples annotated to a given organ / stage, including or not its sub-structures
@julien-roux
Copy link
Contributor Author

julien-roux commented Dec 15, 2016

See https://f1000research.com/articles/5-2748/v1#referee-response-18221, who also thinks this would be a good idea.

We could easily create:

  • a getCallsByCondition function that would return all expression calls for the queried condition. This function would be very similar to loadTopAnatData.R, just outputing the gene2anatomy list in reverse order (anatomy2gene list)

  • a getDataByCondition function that would return all processed data for chips / libraries (depending on the dataType of the input Bgee object) matching a queried condition: organ, stage, (optionally sex), (optionally strain). This function could easily use of a webservice query giving the list of all chip / libraries associated to this condition.
    What is a bit more tricky: to get access to the chips/libraries, the whole experiments .zip files would have to be downloaded, extracted, to use only one or a few files inside... Not very efficient. Another option would be to query the processed data trough the webservice: FTP file transfer success / processed data query through webservice #9

Something for the new postdoc to start with :p ? @marcrr @fbastian

@julien-roux julien-roux changed the title Parsing of annotation Parsing of annotation / query by condition Dec 15, 2016
@fbastian
Copy link
Member

Should it also include sub-conditions? (e.g., expression in substructures of a queried organ). Or shold we provide relations between conditions in a different method?

@marcrr
Copy link

marcrr commented Jan 24, 2017

If it's easy from the R side, then yes I think that it's a good idea to include sub-conditions.

@julien-roux
Copy link
Contributor Author

Ideally, the function could have an argument to specify this, and by default, yes I think it's a good idea to include sub-conditions

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants