Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Need precomputed database of crossovers and overlaps #110

Open
karlmsmith opened this issue Jun 12, 2017 · 0 comments
Open

Need precomputed database of crossovers and overlaps #110

karlmsmith opened this issue Jun 12, 2017 · 0 comments

Comments

@karlmsmith
Copy link
Contributor

Need to add something to SOCAT when adding or updating datasets that will automatically check for crossovers (good things - close matches between two unrelated datasets) as well as overlaps (bad things - duplications in a dataset or a series of datasets). These crossovers and overlaps need to be precomputed and stored in a database, and LAS then needs to report these values in the database instead of trying to do an on-the-fly calculation when the user presses a button.

The current on-the-fly crossover checker IN LAS has a known bug in that it uses the decimated data; otherwise it would be a lot slower. But this is missing a good number of crossovers. It may be using decimated data for one dataset and retrieves all the data for the other dataset which could explain the "one-way" crossover of 33GG20150708 with 316420141024. Another possibility is the code is not ignoring samples that are out-of-order in time and should have a WOCE-4 flag, so the code to limit the number of comparisons being made is jumping past the match in one of the directions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant