Title
Compound Terms and Their Multi-word Variants: Case of German and Russian Languages
Abstract
The terminology of any language and any domain continuously evolves and leads to a constant term renewal. Terms undergo a wide range of morphological and syntactic variations which have to be handled by any NLP applications. If the syntactic variations of multi-word terms have been described and tools designed to process them, only a few works studied the syntagmatic variants of compound terms. This paper is dedicated to the identification of such variants, and more precisely to the detection of synonymic pairs that consist of \"compound term - multi-word term \". We describe a pipeline for their detection, from compound recognition and splitting to alignment of the variants with original terms, through multi-word term extraction. The experiments are carried out for two compound-producing languages, German and Russian, and two specialised domains: wind energy and breast cancer. We identify variation patterns for these two languages and demonstrate that the transformation of a morphological compound into a syntagmatic compound mainly occurs when the term denomination needs to be enlarged.
Year
DOI
Venue
2014
10.1007/978-3-642-54906-9_6
CICLing
Field
DocType
Volume
Noun phrase,Constant term,Terminology,Computer science,Syntagmatic analysis,Natural language processing,Artificial intelligence,Syntax,German
Conference
8403
ISSN
Citations 
PageRank 
0302-9743
0
0.34
References 
Authors
5
2
Name
Order
Citations
PageRank
Elizaveta Clouet100.34
Béatrice Daille230634.40