Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problem due to elastic search being limited to 10000 results #93

Open
cjmyers opened this issue Feb 18, 2021 · 2 comments
Open

Problem due to elastic search being limited to 10000 results #93

cjmyers opened this issue Feb 18, 2021 · 2 comments
Assignees

Comments

@cjmyers
Copy link
Collaborator

cjmyers commented Feb 18, 2021

Elastic search only returns the top 10000 scoring results. If when these are intersected with the Virtuoso criteria results, there are less than a sufficient number of responses (considering offset and limit of the search) and Virtuoso returns results that get filtered by elastic search results, then you may be missing some responses that have scores below the top 10000 hits. To address this, we need to either:

  1. Get more elastic search results until you have sufficient number to return intersecting with the Virtuoso results.

OR

  1. Need to return those results that Virtuoso returns that are not returned by elastic search (caveat these will not be sorted by page rank and may actually not match the search string).

OR

  1. Resort to full Virtuoso search including the search string.
@yu-eric
Copy link
Collaborator

yu-eric commented Feb 25, 2021

@cjmyers See #95.

@cjmyers
Copy link
Collaborator Author

cjmyers commented Feb 25, 2021

  1. Insufficient number of results to return (<limit considering the offset)
  2. Elasticsearch returns 10000 hits
  3. Virtuoso returns results that are not in the 10000 hits from elastic search

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Backlog
Development

No branches or pull requests

3 participants