Title
Reducing Human Effort in Named Entity Corpus Construction Based on Ensemble Learning and Annotation Categorization
Abstract
Annotated named entity corpora play a significant role in many natural language processing applications. However, annotation by humans is time-consuming and costly. In this paper, we propose a high recall pre-annotator which combines multiple existing named entity taggers based on ensemble learning, to reduce the number of annotations that humans have to add. In addition, annotations are categorized into normal annotations and candidate annotations based on their estimated confidence, to reduce the number of human corrective actions as well as the total annotation time. The experiment results show that our approach outperforms the baseline methods in reduction of annotation time without loss in annotation performance (in terms of F-measure).
Year
DOI
Venue
2016
10.1007/978-3-319-50496-4_22
Lecture Notes in Computer Science
Keywords
DocType
Volume
Corpus construction,Named Entity Recognition,Assisted annotation,Ensemble learning
Conference
10102
ISSN
Citations 
PageRank 
0302-9743
0
0.34
References 
Authors
8
3
Name
Order
Citations
PageRank
Tingming Lu101.01
Man Zhu211.02
Zhiqiang Gao334939.84