Abstract | ||
---|---|---|
Document retrieval in languages with a rich and com- plex morphology - particularly in terms of derivation and (single-word) composition - suffers from serious perfor- mance degradation with the stemming-only query-term-to- text-word matching paradigm. We propose an alternative ap- proach in which morphologically complex word forms are segmented into relevant subwords (such as stems, prefixes, suffixes), and subwords constitute the basic unit for index- ing and retrieval. We evaluate our approach on a biomedical document collection. |
Year | Venue | Keywords |
---|---|---|
2003 | FLAIRS Conference | document retrieval,indexation |
Field | DocType | Citations |
Information retrieval,Computer science,Search engine indexing,Prefix,Artificial intelligence,Natural language processing,Document retrieval,Visual Word | Conference | 0 |
PageRank | References | Authors |
0.34 | 13 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Udo Hahn | 1 | 32 | 4.80 |
Stefan Schulz | 2 | 29 | 5.15 |