-
I want transcriptome FASTA sequences from the NCBI nucleotide database (BioProject PRJNA74409, NCBI TSA accession numbers JR970414–JR999999 and JT000001–JT023377). Is there a programmatic way to grab all of these sequences at once? I found this link that suggests using eutils, but I'd have to create a comma-separated list of 52963 accession numbers, and the NCBI Eutils page suggests using "the HTML POST method" if searching for more than 200 accessions (I also have no idea what they mean by HTML POST method). Any suggestions/insights on how to get these sequences easily? |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 1 reply
-
This notebook post might be helpful (it's very similar, but not exactly what you're doing): Look it over and see if you have more questions and I'll be happy to help! |
Beta Was this translation helpful? Give feedback.
-
Actually, you can ignore my previous post. Do the following:
Then, send all of the info to FastA format: |
Beta Was this translation helpful? Give feedback.
-
@kubu4 Thank you! If only the NCBI website wasn't a dumpster fire I probably would have figured it out myself 😂 |
Beta Was this translation helpful? Give feedback.
Actually, you can ignore my previous post. Do the following:
Then, send all of the info to FastA format: