Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Predicting model for CM and MDH dataset #31

Open
Tsinghua-gongjing opened this issue Apr 27, 2023 · 2 comments
Open

Predicting model for CM and MDH dataset #31

Tsinghua-gongjing opened this issue Apr 27, 2023 · 2 comments

Comments

@Tsinghua-gongjing
Copy link

Hi, thank you for the beautiful work.

Porgen has been applied to generate proteins for CM and MDH families. In the Method section, the details are described as:

We computed the AUC in receiver operating characteristic (ROC) curves for predicting binary function labels from model scores. We computed model scores for each sequence in both CM and MDH by using the per-token model log-likelihood in Eq. 2.

Does this mean: (1) for each sequence the log-likelihood is calculated for each token and (2) then a classifier model is employed to predict whether the whole sequence is reactive or not (the label is from experimental data). The features are the calculated log-likelihood score for each token. Could you please also release data/codes/models for this part?

Best regards

@donglg1309
Copy link

Could you please release the corresponding data for this generation of CM/MDH?

Best,
Liguo

@wenyuhaokikika
Copy link

I can not understand what is GB1 (top100avg) ? how to calculate?

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants