Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GFR: Add series to data source for keyword search #16

Open
clingerman opened this issue Jan 24, 2017 · 1 comment
Open

GFR: Add series to data source for keyword search #16

clingerman opened this issue Jan 24, 2017 · 1 comment

Comments

@clingerman
Copy link
Member

Add series as a data source for the results pulled back in a keyword search.

@DominicBM
Copy link
Contributor

@clingerman: I thought about/looked into this, but I don't think it is logically possible with the way the catalog is set up. Since the record groups don't contain the series-level metadata, we'd have to search all RGs, collections, and series as well, even though we are only doing this to find the RGs/collections with matches in their child series. What this means is every series result would essentially just represent a hit for its RG/collection, but then there could be many hits in the series-level records for the same RG/collection.

So, rather than searching a small set (few hundred records) of unique records as it does currently, we are searching a set of hundreds of thousands of records, and we'd have to scan through the entire set each time to extract the record group numbers/collection IDs for every result and then de-duplicate the list to generate the results on the GFR's end. If you search for a common term like "army", you'll get 58,000+ results at the series level, which makes that workflow basically impossible. Even if we weren't limited by pagination, it would take a very long time to load all of those results to generate the GFR search result set from them.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants