Statistical comparisons of non-deterministic IR systems using two dimensional variance | 3 | 0.42 | 2015 |
Evaluating non-deterministic retrieval systems | 3 | 0.49 | 2014 |
Improving test collection pools with machine learning | 4 | 0.39 | 2014 |
Extending test collection pools without manual runs | 4 | 0.41 | 2014 |
Approximate Recall Confidence Intervals | 7 | 0.57 | 2012 |
Principles for robust evaluation infrastructure | 12 | 0.65 | 2011 |
A similarity measure for indefinite rankings | 114 | 3.46 | 2010 |
EVIA 2010: the third international workshop on evaluating information access | 0 | 0.34 | 2010 |
Improvements that don't add up: ad-hoc retrieval results since 1998 | 87 | 6.22 | 2009 |
Score adjustment for correction of pooling bias | 23 | 0.85 | 2009 |
EvaluatIR: an online tool for evaluating and comparing IR systems | 16 | 1.16 | 2009 |
Has adhoc retrieval improved since 1994? | 7 | 0.91 | 2009 |
Statistical power in retrieval experimentation | 39 | 1.66 | 2008 |
Precision-at-ten considered redundant | 21 | 0.99 | 2008 |
Score standardization for inter-collection comparison of retrieval systems | 35 | 2.54 | 2008 |
Strategic system comparisons via targeted relevance judgments | 46 | 2.68 | 2007 |
A pipelined architecture for distributed text query evaluation | 77 | 2.26 | 2007 |
Load balancing for term-distributed parallel retrieval | 41 | 1.39 | 2006 |
Melbourne University at the 2006 Terabyte Track. | 1 | 0.34 | 2006 |
Melbourne University 2005: Enterprise and Terabyte Tasks | 1 | 0.40 | 2005 |
Space-Limited ranked query evaluation using adaptive pruning | 26 | 1.71 | 2005 |
RMIT University at TREC 2005: Terabyte and Robust Track | 8 | 0.65 | 2005 |
RMIT University at TREC 2004 | 10 | 0.82 | 2004 |