Title
Clinical Multi-Label Free Text Classification By Exploiting Disease Label Relation
Abstract
Clinical data describing a patient's health status can be multi-labelled. For example, a clinical record describing patient suffering from cough and fever should be tagged with both two disease labels. These co-occurred labels often have interrelation which can be exploited to improve disease classifications. In this work, we treat the categorization of free clinical text as a multi-label learning problem. However, we discover that some commonly used multi-label learning methods might suffer from some severe side effects in exploiting complicated disease label relation, such as over-exploitation of label relation and error-propagation in label prediction. Based on these findings, we propose a novel multi-label learning algorithm called Ensemble of Sampled Classifier Chains (ESCC) to improve clinical text data classification. ESCC automatically learns to select relevant disease information that is helpful to improve classification performance when exploiting possible disease relation. In our conducted experiments, ESCC shows strong advantages over other state-of-the-art multi-label algorithms on medical text data with significant improvement in performance. The proposed algorithm is promising in mining knowledge from a wide range of multi-label medical text data.
Year
DOI
Venue
2013
10.1109/BIBM.2013.6732508
2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM)
Keywords
Field
DocType
clinical text classification, multi-label learning, disease relation learning
Categorization,Classifier chains,Disease,Text mining,Computer science,Multi label learning,Prediction algorithms,Artificial intelligence,Data classification,Machine learning
Conference
Volume
Issue
ISSN
null
null
2156-1125
Citations 
PageRank 
References 
4
0.44
8
Authors
4
Name
Order
Citations
PageRank
Rui-Wei Zhao1121.99
Guo-Zheng Li236842.62
Jia-Ming Liu3153.14
Xiao Wang4143.33