Skip to content
This repository has been archived by the owner on Oct 12, 2020. It is now read-only.

Picking a dataset #3

Open
BrunoGrandePhD opened this issue Oct 20, 2017 · 5 comments
Open

Picking a dataset #3

BrunoGrandePhD opened this issue Oct 20, 2017 · 5 comments
Labels

Comments

@BrunoGrandePhD
Copy link

I envisioned that we would pick a genomic dataset (or a set of related datasets) that we can all use so that the tutorials are consistent with one another. It then makes it easier to string the tutorials together as part of a longer workshop.

Who has ideas of good datasets that can be analyzed in different ways for each topic we end up selecting?

@privefl
Copy link
Contributor

privefl commented Oct 20, 2017

I think 1000 Genomes is super standard.

@zhenyisong
Copy link
Contributor

It depends on our story-line. We analyze for what?

@BrunoGrandePhD
Copy link
Author

BrunoGrandePhD commented Oct 20, 2017

@privefl: I agree that 1000G is pretty standard. Another option is the Genome in a Bottle, but that's a single sample and doesn't allow any cohort studies.

@zhenyisong: I think it would be too hard to maintain a perfectly uniform storyline across all tutorials. I think the best we can do is pick a uniform dataset (e.g. 1000G data) and analyze it in different ways to cover the various topics we want to develop tutorials for.

@privefl
Copy link
Contributor

privefl commented Oct 20, 2017

What I just discovered: http://googlegenomics.readthedocs.io/en/latest/

@privefl
Copy link
Contributor

privefl commented Oct 20, 2017

1000 Cannabis Genomes Project ahah

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

3 participants