-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hurdle Model on DIA-NN Data #65
Comments
I'll only answer the part I'm more familiar with. The hurdle model tests specifically for differential detection, and in the absence thereof, differential abundance. proDA uses a dropout model that models the probability to missing a feature based on its abundance, and then uses this to test for differential abundance (even when no abundances have been measured in one condition). I don't remember what DEqMS does, and haven't used it. |
Thanks, @lgatto! This is helpful. I'll await your colleagues guidance regarding application of msqrob2 to DIA-NN outputs. Looking forward to trying it out. |
@ococrook I just wanted to circle back and see if you had some insight into this query with regards to DIA-NN outputs as I'd love to be able to use your fantastic tool? |
@ococrook Hope you are doing well. I wanted to check in and see if you had a chance to review my query so that I may utilize your wonderful tool? |
Hi! Sorry for delayed response, yes I would think that's a sensible model input. I would ask @lievenclement to clarify though as I didn't develop the tool |
Thanks, @ococrook! @lievenclement any feedback/thoughts on the use of DIA-NN data as inputs for your wonderful tool? |
Hi @abadgerw, I'm answering on behalf of Lieven. Many apologies for our late reply. We are happy to read your interest in using As mentioned by Laurent, the hurdle approach will compute 2 models: one model for differential abundance (using observed intensity data) and one model for differential detection (using feature count data). In order to use this package, you need to first process your data with
A sensible workflow for DIA-NN data would be to start with the precursor-level data and to aggregate to proteins. However, we still need to investigate how to specify the count-based model for DIA-NN data, for instance we should take into account the differences in protein detection rates across samples, but how to compute these rates is sill unclear to us. Hence, I would consider msqrob's hurdle model on DIA-NN data as experimental. Our schedules are cramped until end of the year, but we plan to work on this by the start of 2025. I hope this can help. |
@lievenclement @lgatto @ococrook Thank you for a great tool! I am looking to fit a hurdle model on data I have run through DIANN. In order to do this, should I be utilizing the protein intensity values and the number of precursors mapped to that protein as inputs to the model?
In addition, for my learning, what is the difference between the hurdle model approach and the approaches used by DEqMS and proDA that also seem to model peptide counts/missingness?
The text was updated successfully, but these errors were encountered: