Title
Tumor Information Extraction in Radiology Reports for Hepatocellular Carcinoma Patients.
Abstract
Hepatocellular carcinoma (HCC) is a deadly disease affecting the liver for which there are many available therapies. Targeting treatments towards specific patient groups necessitates defining patients by stage of disease. Criteria for such stagings include information on tumor number, size, and anatomic location, typically only found in narrative clinical text in the electronic medical record (EMR). Natural language processing (NLP) offers an automatic and scale-able means to extract this information, which can further evidence-based research. In this paper, we created a corpus of 101 radiology reports annotated for tumor information. Afterwards we applied machine learning algorithms to extract tumor information. Our inter-annotator partial match agreement scored at 0.93 and 0.90 F1 for entities and relations, respectively. Based on the annotated corpus, our sequential labeling entity extraction achieved 0.87 F1 partial match, and our maximum entropy classification relation extraction achieved scores 0.89 and 0. 74 F1 with gold and system entities, respectively.
Year
Venue
DocType
2016
CRI
Conference
Volume
Citations 
PageRank 
2016
1
0.35
References 
Authors
0
4
Name
Order
Citations
PageRank
Wen-wai Yim123.06
Tyler Denman210.68
Sharon W. Kwan321.71
Meliha Yetisgen-Yildiz432834.25