Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How are feature groups grouped together (rank_feat_groups_matrixplot)? #73

Open
hildemann opened this issue Nov 25, 2020 · 1 comment
Open

Comments

@hildemann
Copy link

Hi @DaneseAnna ,

now the rank_feature_groups_matrixplot is working however i don't quite understand the process of feature groups being grouped together.
Because most likely this:
epi.pl.rank_feat_groups(adata, feature_symbols='transcript_annotation')
groups features by their variability score right? Because i really can't make any sense of the '3 vs rest' or '10 vs rest' figures this command generates.
And then does it just create 14 groups of certain size with similiar high variability score?

matrixplot_

And then these feature groups are written on top of the rank_feature_matrixplot, right. With the corresponding genes of features on the lower x axis.
To make more sense of this plot i specified grouby='cell_type' which at least made the y axis interpretable.

Thanks for advice,

Valentin

@DaneseAnna
Copy link
Collaborator

Hi Valentin,

You probably clustered your data before running epi.tl.rank_features. It provided you with a set of clusters (13 according to this plot), cells that share a lot of similarities. Then you ran epi.tl.rank_features(adata, groupby="louvain", reference='rest') which performedmultiple t-test (with benjamini-hochberg as a p-value correction method) to identify the most variable features in each of your Louvain cluster compared to all the remaining cells.

So you get in the columns labeled 3 in the heat map the 10 most differentially open features in the cells clustered as 3 with Louvain. And by the look of it, it corresponds to pDC cells.
Now, what you see as feature name is not the transcript annotation. It is probably because you are using raw features to identify the differential peaks and they don't have the transcript annotation. One way to fix it would be to remove the raw layer.

Best,
Anna

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants