Title
Statistical vs. Rule-Based Stemming for Monolingual French Retrieval.
Abstract
This paper describes our approach to the 2006 Adhoc Monolingual Information Retrieval run for French. The goal of our experiment was to compare the performance of a proposed statistical stemmer with that of a rule-based stemmer, specifically the French version of Porter's stemmer. The statistical stemming approach is based on lexicon clustering, using a novel string distance measure. We submitted three official runs, besides a baseline run that uses no stemming. The results show that stemming significantly improves retrieval performance (as expected) by about 9-10%, and the performance of the statistical stemmer is comparable with that of the rule-based stemmer.
Year
DOI
Venue
2006
10.1007/978-3-540-74999-8_14
CLEF (Working Notes)
Keywords
DocType
Volume
french version,lexicon clustering,rule-based stemmer,monolingual french retrieval,statistical stemmer,retrieval performance,adhoc monolingual information retrieval,proposed statistical stemmer,baseline run,novel string distance measure,rule based,information retrieval
Conference
4730
ISSN
ISBN
Citations 
0302-9743
3-540-74998-5
4
PageRank 
References 
Authors
0.48
3
3
Name
Order
Citations
PageRank
Prasenjit Majumder117325.15
Mandar Mitra23092338.20
Kalyankumar Datta3503.60