-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Interpretation of differentials #119
Comments
The end of your first paragraph is correct, I think -- all features should be reported in the differentials. The only reason features should be filtered is if they'd fail the I'll defer to @mortonjt on "a good rule of thumb for those which are considered most significant". |
Oof, sorry for letting this slip. I'll take a stab at answering your question about significance, with the caveat that @mortonjt (and @lisa55asil) are much more authoritative sources on this than I am. As far as I'm aware, there isn't really a way (...at least right now) to get something like a p-value describing the significance of a taxon's place in the differential rankings. Something worth considering is that we don't know (from relative data) how things are absolutely changing: it's possible for all (or at least most) taxa to be changing in a similar way across sample conditions -- an example of this is shown in the toothbrushing example from Fig. 2 of Songbird's paper, where basically everything is decreasing in absolute abundance from before to after brushing (albeit some taxa, e.g. Haemophilus, are decreasing more than others). This ties into this section of the paper, emphasis mine:
So, these sorts of cases demonstrate that attempting to classify certain features' rankings as "significant" or not can be tricky. This comment is kind of rambling so I'll try to wrap this up. In practice, manually looking at the rankings can be useful: depending on the formula you used in Songbird, the differential rankings you get should indicate features' associations with some metadata field(s) of interest to you. So taking the log-ratio of some of the top-ranked to bottom-ranked features is often a useful way to compare groups of samples: and from there, you can see what particular features (or types of features, e.g. features with similar taxonomic classifications) seem to be particularly influencing separation between samples' log-ratios. You can use normal statistical tests on these log-ratios, if you really want to get a measure of "significance" (personally I prefer just quoting effect sizes, but I'm sure other people will have different opinions :). Qurro was designed to make this general approach easier: this part of a tutorial shows one basic "workflow" for this, albeit with ALDEx2 instead of Songbird outputs. Anyway, hope this helps! |
@fedarko , Sincerely appreciate your answer! Pretty comprehensive and useful to me. Thanks!! |
@fedarko I think I understand what you're describing, but for clarification, when you say "personally I prefer just quoting effect sizes", you mean describing the log ratio as an effect size, i.e. fold change? It also leads me to another question. Traditionally, exploratory studies have described the distribution of taxa between sites or treatments at a convenient rank, like Class, Order, or Family to give an overview of results. Is it possible to aggregate the differentials in the same way? For example, would it be valid to use the mean/median differential or rank to convey that Orders A,B, and C were more abundant at Site 1 than Site 2? Or would a better approach be collapsing the biom table at some taxonomic rank (like tax_glom in phyloseq) and rerun the songbird model to get the differentials at the Order level? |
This was in reference to reporting differences in selected log-ratios (of ranked features) between samples -- an example of this is Figs. 1(D) and 2(D) from this paper, where we just reported the R^2 between Of course, you don't have to avoid p-values with this stuff... this forum thread goes into more detail.
My opinion is that the former approach (using the uncollapsed data, and then looking at where the members of Orders A, B, and C fall within the rankings) will be a better idea for most use cases, or at least the ones I can think of. There is often a lot of functional variation within a given taxonomic level (e.g. different E. coli strains doing wildly different stuff), so it might make sense that different features within Order A are present in multiple places in the rankings for a good reason. Figs. 1(A) and 2(A) in the paper I linked above are a nice example of this -- most of the Shewanella (red)-classified features are highly ranked (i.e. are associated with gill samples), but there's that one Shewanella feature over on the left side of the rankings. This feature's ranking and/or taxonomic classification could be a fluke, but it could also be occupying a different environment / serving a meaningfully different function than the other Shewanella features in the plot; and whatever the true reason, this is the sort of pattern that collapsing your data will probably mask. (That being said, if you explicitly don't care about this close of a resolution and just want to look at things at a set level, then I guess collapsing wouldn't be too bad.) These answers are getting kind of subjective, so ...these are my perspectives on these problems as a non-statistician and non-biologist, at least. @mortonjt please feel free to yell at me if I messed something up ;) |
Like always, thank you for the fast and thoughtful response! The compositional data world is still new to me, and every time I review the Songbird paper or Qurro paper I grasp a little more. I agree that keeping the high resolution of the uncollapsed data is the better way to go, and I think I understand now how I can move forward with creating a general description. I'm not a statistician either, so it's nice to get some feedback. Your subjectivity is partly what I was interested in because I'm sure I will hear lots of opinions in the review process :-) |
How do you decide which taxa are reported in the differential ranking? Are the differentials that are reported the coefficients that are the most changing, or those that are significant, or some combination of both? Or is it all taxa but reports as fewer than input bc of the
--min-feature-count
and--min-sample-count
. The text from the paper reads:But how songbird chooses which taxa to report in the
differentials.tsv
is still unclear to me. If it is those that are most changing, what is the cutoff for the coefficient? Is it a sparse model (eg within a standard error of accuracy reported from a nonsparse model)? Is it a hard cutoff of the top n taxa ranked by coefficient? Is there some kind of p-value reported for a cutoff? If it is reporting all features, is there a good rule of thumb for those which are considered most significant?In any case, I am curious how, in a paper's text, one would explain the choice to use the reported differentials as those that distinguish treatment(s).
I saw #83 but this is still unclear to me.
Thank you!
The text was updated successfully, but these errors were encountered: