Title | ||
---|---|---|
Extracting lay paraphrases of specialized expressions from monolingual comparable medical corpora |
Abstract | ||
---|---|---|
Whereas multilingual comparable corpora have been used to identify translations of words or terms, monolingual corpora can help identify paraphrases. The present work addresses paraphrases found between two different discourse types: specialized and lay texts. We therefore built comparable corpora of specialized and lay texts in order to detect equivalent lay and specialized expressions. We identified two devices used in such paraphrases: nominalizations and neo-classical compounds. The results showed that the paraphrases had a good precision and that nominalizations were indeed relevant in the context of studying the differences between specialized and lay language. Neo-classical compounds were less conclusive. This study also demonstrates that simple paraphrase acquisition methods can also work on texts with a rather small degree of similarity, once similar text segments are detected. |
Year | Venue | Keywords |
---|---|---|
2011 | BUCC@ACL/IJCNLP | neo-classical compound,comparable corpus,monolingual corpus,monolingual comparable medical corpus,simple paraphrase acquisition method,specialized expression,similar text segment,good precision,different discourse type,present work address,multilingual comparable corpus,text segmentation |
Field | DocType | Citations |
Nominalization,Degree of similarity,Expression (mathematics),Computer science,Paraphrase,Natural language processing,Artificial intelligence,Linguistics | Conference | 13 |
PageRank | References | Authors |
1.03 | 16 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Louise Deleger | 1 | 234 | 20.13 |
Pierre Zweigenbaum | 2 | 773 | 85.43 |