Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inquiry on Using Mouse Gene Data with scregclust for Regulatory Module Analysis #2

Open
maaa-a opened this issue Nov 12, 2024 · 2 comments

Comments

@maaa-a
Copy link

maaa-a commented Nov 12, 2024

Hello! I am using the scregclust package to analyze regulatory modules in single-cell RNA-seq data from mice, but I encountered an issue. Since scregclust_format defaults to using human-specific lists for transcription factors (TFs) or kinases, I cannot identify any regulators in my mouse data, which leaves the is_regulator vector empty.

Could you recommend a method or alternative regulatory gene list suitable for mouse data? Alternatively, is there a way to customize the transcription factor list to use scregclust with mouse-specific regulatory information?

Thank you very much for your time and for developing this powerful tool. I look forward to any advice or recommendations you might have.

@cyianor
Copy link
Member

cyianor commented Nov 15, 2024

In our work we only considered human scRNA-seq data and the builtin regulator lists are biased towards that. You are always free to supply your own list. The is_regulator input is simply a logical vector or a 0/1 vector where 1/TRUE indicates that a gene is a tentative regulator and 0/FALSE is a target gene. So if you have a custom list of TFs or kinases or other regulators you'd like to consider, you can simply create this vector yourself and supply it to the call to scregclust.

@idacharlottalarsson
Copy link
Collaborator

idacharlottalarsson commented Nov 18, 2024

Hi! Just adding to Felix's answer. Using your own list of mouse-specific transcription factors to create the is_regulator vector is the best way to go and I've done this previously using the list provided here (https://resources.aertslab.org/cistarget/tf_lists/). The code would look something like this:

out <- scregclust_format(z, mode = "TF") #z is your gene expression matrix (genes x cells)
genesymbols <- out$genesymbols 
sample_assignment <- out$sample_assignment
is_regulator <- out$is_regulator #this will be only 0s

ix<-which(genesymbols %in% mouseTF) #mouseTF is the list of mouse transcription factors above
is_predictor[ix]<-1

Then you should be able to continue as usual :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants