Title
Experiments with N-Gram Prefixes on a Multinomial Language Model versus Lucene's Off-the-shelf Ranking Scheme and Rocchio Query Expansion (TEL@CLEF Monolingual Task).
Abstract
We describe our participation in the TEL@CLEF task of the CLEF 2009 ad-hoc track, where we measured the retrieval performance of LGTE, an index engine for Geo-Temporal collections which is mostly based on Lucene, together with extensions for query expansion and multinomial language modelling. We experiment an N-Gram stemming model to improve our last year experiments which consisted in combinations of query expansion, Lucene's off-the-shelf ranking scheme and the ranking scheme based on multinomial language modeling. The N-Gram stemming model was based in a linear combination of N-Grams, with N between 2 and 5, using weight factors obtained by learning from last year topics and assessments. The Rocchio ranking function was also adapted to implement this N-Gram model. Results show that this stemming technique together with query expansion and multinomial language modeling both result in increased performance.
Year
DOI
Venue
2009
10.1007/978-3-642-15754-7_10
CLEF (Working Notes)
Keywords
DocType
Volume
increased performance,ranking scheme,clef task,multinomial language,n-gram model,multinomial language modelling,rocchio query expansion,multinomial language modeling,clef monolingual task,n-gram prefix,off-the-shelf ranking scheme,query expansion,rocchio ranking function,language model,vector space model,indexation
Conference
6241
ISSN
ISBN
Citations 
0302-9743
3-642-15753-X
1
PageRank 
References 
Authors
0.37
5
3
Name
Order
Citations
PageRank
Jorge Machado131.12
Bruno Martins244134.58
José Luis Borbinha315120.02