Abstract | ||
---|---|---|
This paper describes our approach to the 2006 Adhoc Monolingual Information Retrieval run for French. The goal of our experiment was to compare the performance of a proposed statistical stemmer with that of a rule-based stemmer, specifically the French version of Porter's stemmer. The statistical stemming approach is based on lexicon clustering, using a novel string distance measure. We submitted three official runs, besides a baseline run that uses no stemming. The results show that stemming significantly improves retrieval performance (as expected) by about 9-10%, and the performance of the statistical stemmer is comparable with that of the rule-based stemmer. |
Year | DOI | Venue |
---|---|---|
2006 | 10.1007/978-3-540-74999-8_14 | CLEF (Working Notes) |
Keywords | DocType | Volume |
french version,lexicon clustering,rule-based stemmer,monolingual french retrieval,statistical stemmer,retrieval performance,adhoc monolingual information retrieval,proposed statistical stemmer,baseline run,novel string distance measure,rule based,information retrieval | Conference | 4730 |
ISSN | ISBN | Citations |
0302-9743 | 3-540-74998-5 | 4 |
PageRank | References | Authors |
0.48 | 3 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Prasenjit Majumder | 1 | 173 | 25.15 |
Mandar Mitra | 2 | 3092 | 338.20 |
Kalyankumar Datta | 3 | 50 | 3.60 |