Title
A Pattern-Based Method for Medical Entity Recognition From Chinese Diagnostic Imaging Text.
Abstract
The identification of medical entities and relations from electronic medical records is a fundamental research issue for medical informatics. However, the task of extracting valuable knowledge from these records is challenging due to its high complexity. The accurate identification of entity and relation is still an open research problem in medical information extraction. A pattern-based method for extracting certain tumor-related entities and attributes from Chinese unstructured diagnostic imaging text is proposed. This method is a composition of three steps. Firstly, an algorithm based on keyword matching is designed to obtain the primary sites of tumors. Then a set of regular expressions is applied to identify primary tumor size information. Finally, a set of rules is defined to acquire metastatic sites of tumors. Our method achieves a recall of 0.697, a precision of 0.825 and an F1 score of 0.755 using an overall weighted metric. For each of the extraction tasks, the F1 scores are 0.784, 0.822 and 0.740. The method proves to be stable and robust with different amounts of testing data. It achieves a comparatively high performance in the CHIP 2018 open challenge, demonstrating its effectiveness in extracting tumor-related entities from Chinese diagnostic imaging text.
Year
DOI
Venue
2019
10.3389/frai.2019.00001
Frontiers Artif. Intell.
Keywords
DocType
Volume
clinical text,information extraction,medical named entity recognition,natural language processing,pattern-based strategy
Journal
2
ISSN
Citations 
PageRank 
2624-8212
0
0.34
References 
Authors
0
5
Name
Order
Citations
PageRank
Zihong Liang100.34
Junjie Chen200.34
Zhaopeng Xu300.34
Yuyang Chen435.78
Tianyong Hao55413.89