-
Notifications
You must be signed in to change notification settings - Fork 80
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
MRG: fix duplicate md5 in picklist problem (#2747)
This PR changes picklists using `manifest`, `prefetch`, `search`, and `gather` coltypes to use a composite tuple of `(ident, md5short)` for selection. This should have essentially zero false positives, unlike the current behavior (which relies solely on md5). The basic problem solved: when two signatures with different signature metadata (name, etc) but the same md5sum were in a single index, grepping/extracting via metadata (e.g. with a name) would return all signatures with matching md5sum. This PR fixes this behavior, and also cleans up `SignaturePicklist` a fair bit. The only potential downsides: * the memory size of picklists will increase because it is storing `ident` not just `md5` * `name`/`ident` takes on more official meaning in sourmash (as in, they should be meaningful or at least distinct) Fixes #2617 Fixes #2593 Closes #2602
- Loading branch information
Showing
8 changed files
with
261 additions
and
63 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.