Title
A transition-based joint model for disease named entity recognition and normalization.
Abstract
Motivation: Disease named entities play a central role in many areas of biomedical research, and automatic recognition and normalization of such entities have received increasing attention in biomedical research communities. Existing methods typically used pipeline models with two independent phases: (i) a disease named entity recognition (DER) system is used to find the boundaries of mentions in text and (ii) a disease named entity normalization (DEN) system is used to connect the mentions recognized to concepts in a controlled vocabulary. The main problems of such models are: (i) there is error propagation from DER to DEN and (ii) DEN is useful for DER, but pipeline models cannot utilize this. Methods: We propose a transition-based model to jointly perform disease named entity recognition and normalization, casting the output construction process into an incremental state transition process, learning sequences of transition actions globally, which correspond to joint structural outputs. Beam search and online structured learning are used, with learning being designed to guide search. Compared with the only existing method for joint DEN and DER, our method allows non-local features to be used, which significantly improves the accuracies. Results: We evaluate our model on two corpora: the BioCreative V Chemical Disease Relation (CDR) corpus and the NCBI disease corpus. Experiments show that our joint framework achieves significantly higher performances compared to competitive pipeline baselines. Our method compares favourably to other state-of-the-art approaches.
Year
DOI
Venue
2017
10.1093/bioinformatics/btx172
BIOINFORMATICS
Field
DocType
Volume
Data mining,Normalization (statistics),Computer science,Speech recognition,Named-entity recognition
Journal
33
Issue
ISSN
Citations 
15
1367-4803
5
PageRank 
References 
Authors
0.43
10
6
Name
Order
Citations
PageRank
Yinxia Lou161.45
Yue Zhang21364114.17
Qian Tao35914.00
Fei Li49739.93
Shufeng Xiong5283.80
Donghong Ji6892120.08