Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Has the link expired? I did not find the dbscSNV.zip data file. #3

Open
Licko0909 opened this issue Feb 28, 2019 · 5 comments
Open

Comments

@Licko0909
Copy link

Licko0909 commented Feb 28, 2019

Hello, I want to reproduce the four models in the kipoi manuscript,but according to the download link, I did not find the dbscSNV.zip data file.

image

According to the instructions, to reproduce "tf-binding - Figure 2", I need to first enter the "/src/splicing" path and execute "snakemake" to reproduce "splicing - Figure 5", as shown in the following two figures:

image

image

Later, I implemented it as required, and finally this problem occurred:

  1. After executing snakemake, everything goes well until the dbscSNV.smk part is run. The error is displayed on line 101 in /manuscript/src/splicing/dbscSNV.smk. The zip file cannot be obtained. Does the file ftp://dbnsfp:[email protected]/dbscSNV.zip exist?

image

You can try it,Wget ftp://dbnsfp:[email protected]/dbscSNV.zip

  1. In addition, I found that line 24 in /src/splicing/Snakefile has written "/data".
    image

Last question: can I just run src/splicing/download_annotation.smk to download the GFT and FASTA data needed to reproduce "tf binding - Fig2"? How to run the .smk file? Because "snakemake" that executes "splicing - Fig5" now appears above "No such file dbscSNV.zip"

Sincerely hope to get your answer, I look forward to it! thank!

@Licko0909 Licko0909 changed the title I want to reproduce the four models in the kipoi manuscript, I did not find the dbscSNV.zip data file. Has the link expired? I did not find the dbscSNV.zip data file. Feb 28, 2019
@Avsecz
Copy link
Contributor

Avsecz commented Mar 1, 2019

  1. Try an alternative link for dbscSNV from https://drive.google.com/uc?id=0B60wROKy6OqcZkw2bWt2TGU5NDA&export=download
  2. I fixed the data/data line. Thanks for reporting!
  3. Yes. You can now just download the annotations now using snakemake download_annotations in the src/splicing folder

@Licko0909
Copy link
Author

  1. Try an alternative link for dbscSNV from https://drive.google.com/uc?id=0B60wROKy6OqcZkw2bWt2TGU5NDA&export=download
  2. I fixed the data/data line. Thanks for reporting!
  3. Yes. You can now just download the annotations now using snakemake download_annotations in the src/splicing folder

Thank!I successfully downloaded the dbscSNV.zip file according to the link you gave, and modified the corresponding code of the snakefile to ensure the smooth execution of the code. As I expected, there is no longer any error about dbscSNV, but the problem finally appears,
image
image
image
image

Based on the error message, I was thinking about whether there was a problem with one of the input files here. But I guarantee that the way these input files are obtained is either executed by snakemake in splicing or obtained by git clone.

@Avsecz
Copy link
Contributor

Avsecz commented Mar 3, 2019

Try to remove the .fai or .tbi files of the reference genome fasta file and the vcf file(s). Maybe the index files were still present in the shared data and should get re-generated automatically after you'll remove them.

@Licko0909
Copy link
Author

Try to remove the .fai or .tbi files of the reference genome fasta file and the vcf file(s). Maybe the index files were still present in the shared data and should get re-generated automatically after you'll remove them.

Wow!It‘s working now!,thank you very much!!!

@Licko0909
Copy link
Author

Try to remove the .fai or .tbi files of the reference genome fasta file and the vcf file(s). Maybe the index files were still present in the shared data and should get re-generated automatically after you'll remove them.

Yesterday, according to your prompt, I deleted the 20180429.filtered.vcf.gz.tbi file, and then I can successfully execute snakemake, and generated the following file:
image

It is running successfully and lasted for 30min
image

After running for 30 minutes, it stopped at 83% of the location. It reported "no dbscSNV.zip file". But I downloaded it from your link the previous day and stored it in "data/raw/splicing/dbscSNV"

image

I found that it automatically deleted data/raw/splicing/dbscSNV/dbscSNV.zip and delete the other files obtained by the pressure.

Later I re-downloaded the dbscSVN.zip file and placed it in the specified path.

The result is still an error, the execution log is as follows:
image

I found that "Job counts" has changed from 12 to 2!

image

I don't know if it is because of "cd ~/manuscript"

In order to solve the problem, I manually unpack dbscSNV.zip, leaving the required dbscSNV.chr1 and dbscSNV.zip, and executing snakemak into /src/splicing, still reporting the error.

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants