Title
Effective Use of Bidirectional Language Modeling for Medical Named Entity Recognition.
Abstract
Biomedical named entity recognition (NER) is a fundamental task in text mining of medical documents and has a lot of applications. Existing approaches for NER require manual feature engineering in order to represent words and its corresponding contextual information. Deep learning based approaches have been gaining increasing attention in recent years as their weight parameters can be learned end-to-end without the need for hand-engineered features. These approaches rely on high-quality labeled data which is expensive to obtain. To address this issue, we investigate how to use widely available unlabeled text data to improve the performance of NER models. Specifically, we train a bidirectional language model (Bi-LM) on unlabeled data and transfer its weights to a NER model with the same architecture as the Bi-LM, which results in a better parameter initialization of the NER model. We evaluate our approach on three datasets for disease NER and show that it leads to a remarkable improvement in F1 score as compared to the model with random parameter initialization. We also show that Bi-LM weight transfer leads to faster model training. In addition, our model requires fewer training examples to achieve a particular F1 score.
Year
Venue
Field
2017
arXiv: Computation and Language
F1 score,Medical documents,Text mining,Computer science,Feature engineering,Natural language processing,Artificial intelligence,Deep learning,Initialization,Named-entity recognition,Language model
DocType
Volume
Citations 
Journal
abs/1711.07908
2
PageRank 
References 
Authors
0.36
19
3
Name
Order
Citations
PageRank
devendra singh sachan1334.51
Pengtao Xie233922.63
Bo Xing37332471.43