Skip to content

dljve/IR

Repository files navigation

Information Retrieval

Indexes

Converting ms-marco.trec.gz to lucene index:

python -m pyserini.index -collection CleanTrecCollection -generator DefaultLuceneDocumentGenerator -threads 1 -input C:/msmarco-docs.trec -index C:/msmarco-doc/lucene-index.msmarco-doc.pos+docvectors+rawdocs -storePositions -storeDocvectors -storeRaw

where -input specifies the folder with the file ms-marco.trec.gz and -index specifies the output folder. Pyserini needs to be installed.

Evaluation

To use trec_eval.exe, cygwin must be installed and C:/cygwin64/bin must be added to the system and environment PATH variable.

Links

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published