Skip to content

How to run

lucasfijen edited this page May 28, 2019 · 10 revisions

Required packages

  • Python 3.6
  • Elasticsearch 5.6.3 (and it's python package)
  • WordCloud for Python3
  • Flask (http://flask.pocoo.org/)
  • p7zip (sudo apt-get install p7zip-full)

Steps to run

Step 1: make a few folders

In the the folder of the search engine make three folders:

  • dataset
  • json_files
  • json_bulks

Step 2: get a dataset

go to https://archive.org/details/stackexchange

download a few small categories from the 7z map (make sure you get the normal ones and not the meta files) place the .7z files in the dataset folder.

Step 3: parse the dataset to json bulks

  • run parse_to_json.py
  • then run bulk.py

Step 4: push to the server

  • make sure elastic search is active
  • run push_to_server.py

Step 5: open the search engine

  • run html/server.py
  • open browser and go to localhost:5000