Title
Using The Machine Learning Approach To Predict Patient Survival From High-Dimensional Survival Data
Abstract
Survival analysis with high-dimensional data deals with the prediction of patient survival based on their gene expression data and clinical data. A crucial task for the accuracy of survival analysis in this context is to select the features highly correlated with the patient's survival time. Since the information about class labels is hidden, existing feature selection methods in machine learning are not applicable. In contrast to classical statistical methods which address this issue with the Cox score, we propose to tackle this problem by discretizing the survival time of patients into a suitable number of subgroups via silhouettes clustering validity. To cope with patients' censoring, we use "k-nearest neighbor" based on clinical parameters. Feature selection is then accomplished using Fast Correlation-Based Filtering approach from machine learning community. The effectiveness and efficiency of the proposed method are demonstrated through comparisons with classical statistical methods on real-world datasets and simulation datasets.
Year
Venue
Keywords
2016
2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM)
Survival prediction, machine learning, statistical method, high-dimensional survival data
Field
DocType
ISSN
Data mining,Survival data,Feature selection,Computer science,Filter (signal processing),Correlation,Artificial intelligence,Bioinformatics,Cluster analysis,Survival analysis,Censoring (statistics),Machine learning
Conference
2156-1125
Citations 
PageRank 
References 
0
0.34
4
Authors
3
Name
Order
Citations
PageRank
Wenbin Zhang100.34
Jian Tang2526148.30
Nuo Wang300.34