Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multiple IDs conflict when adding to solr #61

Closed
talentoscope opened this issue Sep 14, 2016 · 1 comment
Closed

Multiple IDs conflict when adding to solr #61

talentoscope opened this issue Sep 14, 2016 · 1 comment

Comments

@talentoscope
Copy link

When adding new xml files to a solr instance using bzcat and the wiki extractor, when adding them to collection1 they appear to be given the same ID which causes problems when YodaQA crossreferences links. This then goes on to cause problems with YodaQA parsing incorrect documents.

Would it be possible to give instructions to adding new data sources, such as other wikis (Simple, Species, Wikiversity, etc), as the live version seems to encompass other sources?

@pasky
Copy link
Member

pasky commented Sep 21, 2016

Unfortunately, that'd be quite some effort which we are unable to spend right now - this is essentially #17 duplicate, I'd propose. The default YodaQA version uses only the enwiki data source for text too. It just also does a Bing search (which is also part of the source).

@pasky pasky closed this as completed Sep 21, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants