Title
Annotation of specialized corpora using a comprehensive entity and relation scheme.
Abstract
Annotated corpora are essential resources for many applications in Natural Language Processing. They provide insight on the linguistic and semantic characteristics of the genre and domain covered, and can be used for the training and evaluation of automatic tools. In the biomedical domain, annotated corpora of English texts have become available for several genres and subfields. However, very few similar resources are available for languages other than English. In this paper we present an effort to produce a high-quality corpus of clinical documents in French, annotated with a comprehensive scheme of entities and relations. We present the annotation scheme as well as the results of a pilot annotation study covering 35 clinical documents in a variety of subfields and genres. We show that high inter-annotator agreement can be achieved using a complex annotation scheme.
Year
Venue
Keywords
2014
LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION
Annotation,Clinical Texts,Natural Language Processing
Field
DocType
Citations 
Annotation,Information retrieval,Computer science,Artificial intelligence,Natural language processing
Conference
5
PageRank 
References 
Authors
0.40
10
5
Name
Order
Citations
PageRank
Louise Deleger123420.13
Anne-Laure Ligozat29822.95
Cyril Grouin317030.22
Pierre Zweigenbaum477385.43
Aurélie Névéol556550.50