Title
Extracting lay paraphrases of specialized expressions from monolingual comparable medical corpora
Abstract
Whereas multilingual comparable corpora have been used to identify translations of words or terms, monolingual corpora can help identify paraphrases. The present work addresses paraphrases found between two different discourse types: specialized and lay texts. We therefore built comparable corpora of specialized and lay texts in order to detect equivalent lay and specialized expressions. We identified two devices used in such paraphrases: nominalizations and neo-classical compounds. The results showed that the paraphrases had a good precision and that nominalizations were indeed relevant in the context of studying the differences between specialized and lay language. Neo-classical compounds were less conclusive. This study also demonstrates that simple paraphrase acquisition methods can also work on texts with a rather small degree of similarity, once similar text segments are detected.
Year
Venue
Keywords
2011
BUCC@ACL/IJCNLP
neo-classical compound,comparable corpus,monolingual corpus,monolingual comparable medical corpus,simple paraphrase acquisition method,specialized expression,similar text segment,good precision,different discourse type,present work address,multilingual comparable corpus,text segmentation
Field
DocType
Citations 
Nominalization,Degree of similarity,Expression (mathematics),Computer science,Paraphrase,Natural language processing,Artificial intelligence,Linguistics
Conference
13
PageRank 
References 
Authors
1.03
16
2
Name
Order
Citations
PageRank
Louise Deleger123420.13
Pierre Zweigenbaum277385.43