Biomedical text retrieval in languages with a complex morphology - Citegraph

Paper Info

Title
Biomedical text retrieval in languages with a complex morphology

Abstract
Document retrieval in languages with a rich and complex morphology - particularly in terms of derivation and (single-word) composition - suffers from serious performance degradation with the stemming-only query-term-to-text-word matching paradigm. We propose an alternative approach in which morphologically complex word forms are segmented into relevant subwords (such as stems, named entities, acronyms), and subwords constitute the basic unit for indexing and retrieval. We evaluate our approach on a large biomedical document collection.

Year	DOI	Venue
2002	10.3115/1118149.1118158	ACL Workshop on Natural Language Processing in the Biomedical Domain
Keywords	DocType	Citations
complex word form,large biomedical document collection,complex morphology,biomedical text retrieval,serious performance degradation,basic unit,alternative approach,document retrieval,stemming-only query-term-to-text-word,relevant subwords	Conference	6
PageRank	References	Authors
0.70	14	3

Authors (3 rows)

Cited by (6 rows)

References (14 rows)

Name	Order	Citations	PageRank
Stefan Schulz	1	6	1.38
Martin Honeck	2	21	2.62
Udo Hahn	3	32	4.80

1