Skip to content

Latest commit

 

History

History
11 lines (8 loc) · 368 Bytes

README.md

File metadata and controls

11 lines (8 loc) · 368 Bytes

spectral_jaccard_similarity

Dependencies

  • mmh3 hash library
  • SimpleFastaParser from Bio.SeqIO.FastaIO
  • tqdm progress bars

Instructions

To create filtered datasets run Bacterial_pipeline_part1.py and Bacterial_pipeline_part2.py.

To use our pre-filtered datasets first run makeFolders.py then run pipeline_wrapper.py. All the code is encapsulated in there.