Build Data Set If you want to build the data set used in my thesis, just open a console and type: git clone https://github.com/dh-thesis/build.git cd build ./init Prerequisites Git Python Virtualenv Firefox Gecko Driver See also dh-thesis/crawl dh-thesis/retrieve Result dh-thesis/base