Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP: batched synchronous zip writing #7

Closed
wants to merge 5 commits into from
Closed

WIP: batched synchronous zip writing #7

wants to merge 5 commits into from

Conversation

bluegenes
Copy link
Collaborator

No description provided.

@bluegenes
Copy link
Collaborator Author

Benchmarking:

software/version command acc details time max RAM
v0.1.0 gbsketch 9 fungal 6min 156 MB
sync-zip branch gbsketch 9 fungal 2min 190 MB

9 fungal accessions total

/usr/bin/time -v sourmash scripts gbsketch update.20240503-fungi.head10.csv -o test10.zip --failed test10.failed.csv -r 1 --param-str dna,k=31,scaled=1000 --genomes-only

this branch:

Loaded 9 rows in total
Writing manifest
...gbsketch is done! Sigs in 'test10.zip'. Fastas in '.'.
        Command being timed: "sourmash scripts gbsketch update.20240503-fungi.head10.csv -o test10.zip --failed test10.failed.csv -r 1 --param-str dna,k=31,scaled=1000 --genomes-only"
        User time (seconds): 5.74
        System time (seconds): 0.94
        Percent of CPU this job got: 5%
        Elapsed (wall clock) time (h:mm:ss or m:ss): 1:57.22
        Average shared text size (kbytes): 0
        Average unshared data size (kbytes): 0
        Average stack size (kbytes): 0
        Average total size (kbytes): 0
        Maximum resident set size (kbytes): 189108
        Average resident set size (kbytes): 0
        Major (requiring I/O) page faults: 68
        Minor (reclaiming a frame) page faults: 79681
        Voluntary context switches: 26612
        Involuntary context switches: 587
        Swaps: 0
        File system inputs: 10224
        File system outputs: 2088
        Socket messages sent: 0
        Socket messages received: 0
        Signals delivered: 0
        Page size (bytes): 4096
        Exit status: 0

main:

Command being timed: "sourmash scripts gbsketch update.20240503-fungi.head10.csv -o test10.zip --failed test10.failed.csv -r 1 --param-str dna,k=31,scaled=1000 --genomes-only"
        User time (seconds): 5.98
        System time (seconds): 1.05
        Percent of CPU this job got: 1%
        Elapsed (wall clock) time (h:mm:ss or m:ss): 5:59.30
        Average shared text size (kbytes): 0
        Average unshared data size (kbytes): 0
        Average stack size (kbytes): 0
        Average total size (kbytes): 0
        Maximum resident set size (kbytes): 156624
        Average resident set size (kbytes): 0
        Major (requiring I/O) page faults: 112
        Minor (reclaiming a frame) page faults: 66385
        Voluntary context switches: 30159
        Involuntary context switches: 2821
        Swaps: 0
        File system inputs: 19776
        File system outputs: 2160
        Socket messages sent: 0
        Socket messages received: 0
        Signals delivered: 0
        Page size (bytes): 4096
        Exit status: 0

@bluegenes
Copy link
Collaborator Author

Closing in favor of #8

@bluegenes bluegenes closed this May 8, 2024
@bluegenes bluegenes deleted the sync-zip branch July 15, 2024 18:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant