Implementations of Data Fusion algorithms for Information Retrieval.
This repository contains implementations (in Perl 5) of the following algorithms, of which I am an original co-author:
-
ProbFuse:
- A Probabilistic Approach to Data Fusion. David Lillis, Fergus Toolan, Rem Collier and John Dunnion. In Proceedings of the 29th annual international ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '06), pages 139--146, Seattle, WA, USA, 2006. ACM (http://dx.doi.org/10.1145/1148170.1148197).
- Probabilistic Data Fusion on a Large Document Collection. David Lillis, Fergus Toolan, Rem Collier and John Dunnion. Artificial Intelligence Review, 26(1-2):23--34, 2006 (doi: http://dx.doi.org/10.1007/s10462-007-9037-2)
-
SlideFuse:
- Extending Probabilistic Data Fusion Using Sliding Windows. David Lillis, Fergus Toolan, Rem Collier and John Dunnion. In C. Macdonald, I. Ounis, V. Plachouras, I. Ruthven, and R. W. White, editors, Advances in Information Retrieval. Proceedings of the 30th European Conference on Information Retrieval Research (ECIR 2008), volume 4956 of Lecture Notes in Computer Science, pages 358--369. Springer Berlin Heidelberg, Berlin, 2008 (doi: http://dx.doi.org/10.1007/978-3-540-78646-7_33).
-
MAPFuse:
- Estimating Probabilities for Effective Data Fusion. David Lillis, Lusheng Zhang, Fergus Toolan, Rem W. Collier, David Leonard and John Dunnion. In Proceedings of the 33rd Annual ACM SIGIR Conference on Research and Development in Information Retrieval, pages 347--354, Geneva, Switzerland, 2010. ACM (doi: http://dx.doi.org/10.1145/1835449.1835508).
It also offers some implementations of other fusion algorithms.
All are intended to work with topfiles and qrels offered by TREC.
Documentation is a little sketchy: questions by email to < dave /at/ lill /dot/ is >.