Skip to content

Solr Instance with Indexed Wikipedia Subset

Digo edited this page Nov 17, 2014 · 4 revisions

Solr Instance with Indexed Wikipedia Subset

This folder contains a Solr server with an indexed Wikipedia subset about world history.

This solr instance has been used in CMU baseline's system for QALab: https://github.com/oaqa/ntcir-qalab-cmu-baseline

There is a running instance hosted at a CMU server: http://ur.lti.cs.cmu.edu:8985/solr/

Download

Click the "download" icon from the top bar: https://drive.google.com/file/d/0B551g4MfkZQCZjFLdGo5b3hBUEE

A csv dump of un-indexed document texts and metadata is also available for downloading: https://drive.google.com/file/d/0B551g4MfkZQCOElMS0FxQ3g5aDA

Usage

#To unzip the file
$ unzip solr-wikipedia-history.zip
# To start this Solr instance at your own machine:
$ cd ./solr-wikipedia-history/example/
$ java -Dsolr.solr.home=multicore -jar start.jar

Then check instance from the browser: http://localhost:8985/solr/#/history

If you want to use your own solr instance with CMU's baseline system, you need to replace the SOLR_SERVER_URL with your own server's url in the SimpleHypothesisScorer descriptor:

https://github.com/oaqa/ntcir-qalab-cmu-baseline/blob/master/src/main/resources/annotators/SimpleHypothesisScorer.xml

License

Apache 2.0