-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
how can I create the input features for my own dataset by running the py. file? Is it possible to simply the input to only sequence of the protein? #9
Comments
Yes. The input feature generation is similar to AF2. A sample shell script is here for individual sequences. |
What about this .py file: run_af2c_fea.py, which is also said to be used to get features? |
The shell script calls this python script you referred to do the job. |
Could you please give an example of the script on colab just to generate the features of a protein sequence from uniprot? |
The Colab notebook we provided only takes In this example, you have a heterodimer HgcAB composed of two monomers, HgcA and HgcB. Organizes the feature input as the following:
Then tar this folder into a single tarball and upload it to our notebook. Note that our code can take gzipped pickle files directly. It is up to you whether or not to gzip the pickle files before you make the tarball. After you upload the attached tarball, you may run a test to predict a heterodimer using the target syntax: |
Thank you for the instruction! I have successfully got the features from other AF notebook(it used 0 sequence template and I am not sure whether it would matter compared to your examples) and did prediction on some protein complexes. However it seems to have a very high false negative rate(the proteins were supposed to be interacting but the output was not), is there any way to improve that? |
Many things to try, such as:
|
@FreshAirTonight Due to my limited resources, I'm focusing on the GPU-based second step and considering tools like ColabFold to create the feature.pkl files. What files and steps are needed to create the feature.pkl file? Is there a tool available on Google Colab or Kaggle for this? |
@fereidoon27 An example of feature generation script You may use precomputed MSAs as well. Under the intended output folder of a protein, create a subfolder named |
No description provided.
The text was updated successfully, but these errors were encountered: