Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow searching for library compounds without mass spectrum #236

Open
niekdejonge opened this issue Mar 22, 2024 · 4 comments
Open

Allow searching for library compounds without mass spectrum #236

niekdejonge opened this issue Mar 22, 2024 · 4 comments

Comments

@niekdejonge
Copy link
Collaborator

@jdsmith145 Josh Smith (from Tomas Pluscal lab) mentioned that it would be nice to find good matches from a list of already known metabolites for the analyzed species. These compounds often won't have a mass spectrum available, but will be available in structural libraries.

What we could implement in MS2Query is a ranking system for compounds without spectra. I would implement this, by taking the MS2Query approach where we search for closely related library structures and calculating the MS2Deepscore with all of them. The only difference is that the similarity with spectra of the original structure cannot be included.

This would make it possible to do biologically informed searches, using public structural libraries. Might be interesting to implement for MS2Query 2.0.

Josh might be interested in providing us with some case studies for this approach.

@niekdejonge
Copy link
Collaborator Author

So this would be a bit of a targeted analogue search approach. I think quite some people might be interested in something like this.

@justinjjvanderhooft
Copy link

Interesting approach indeed! We would need some thresholds on structural similarity based on case studies to see what is workable, but definitely worth a try!

@florian-huber
Copy link
Member

I am not sure I understand what the approach would be.
Do you mean, searching for analogues with the additional constraint that they should be close to a list of known structures?

@niekdejonge
Copy link
Collaborator Author

Close to that. Instead of predicting an analogue from the entire library, just a ranking/prediction is made for the provided list of structures. But this prediction will be based on the MS2Deepscore scores with closely related structures in the library.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants