Picking a dataset #3

BrunoGrandePhD · 2017-10-20T16:48:24Z

I envisioned that we would pick a genomic dataset (or a set of related datasets) that we can all use so that the tutorials are consistent with one another. It then makes it easier to string the tutorials together as part of a longer workshop.

Who has ideas of good datasets that can be analyzed in different ways for each topic we end up selecting?

privefl · 2017-10-20T16:58:01Z

I think 1000 Genomes is super standard.

zhenyisong · 2017-10-20T16:58:32Z

It depends on our story-line. We analyze for what?

BrunoGrandePhD · 2017-10-20T17:09:29Z

@privefl: I agree that 1000G is pretty standard. Another option is the Genome in a Bottle, but that's a single sample and doesn't allow any cohort studies.

@zhenyisong: I think it would be too hard to maintain a perfectly uniform storyline across all tutorials. I think the best we can do is pick a uniform dataset (e.g. 1000G data) and analyze it in different ways to cover the various topics we want to develop tutorials for.

privefl · 2017-10-20T17:09:29Z

What I just discovered: http://googlegenomics.readthedocs.io/en/latest/

privefl · 2017-10-20T17:13:08Z

1000 Cannabis Genomes Project ahah

BrunoGrandePhD added the question label Oct 20, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Picking a dataset #3

Picking a dataset #3

BrunoGrandePhD commented Oct 20, 2017

privefl commented Oct 20, 2017

zhenyisong commented Oct 20, 2017

BrunoGrandePhD commented Oct 20, 2017 •

edited

Loading

privefl commented Oct 20, 2017

privefl commented Oct 20, 2017

Picking a dataset #3

Picking a dataset #3

Comments

BrunoGrandePhD commented Oct 20, 2017

privefl commented Oct 20, 2017

zhenyisong commented Oct 20, 2017

BrunoGrandePhD commented Oct 20, 2017 • edited Loading

privefl commented Oct 20, 2017

privefl commented Oct 20, 2017

BrunoGrandePhD commented Oct 20, 2017 •

edited

Loading