Title
A tree ensemble-based two-stage model for advanced-stage colorectal cancer survival prediction.
Abstract
Classification techniques have widely been applied to cancer survival prediction for predicting survival or death of patients. However, little attention has been paid to patients who are predicted to die. In this work, we consider survival prediction as a two-stage task, where the first stage is to predict whether the outcome is survival or not, and the second stage is to predict the remaining lifespan for patients whose predicted outcome is death. To this end, we propose a two-stage machine learning model to enhance cancer survival prediction. At the first stage, a tree-based imbalanced ensemble classification method is proposed for classification of the survivability of advanced-stage cancer patients. At the second stage, a selective ensemble regression method is proposed for survival time prediction, where a priori knowledge is adopted for feature selection and the mean proportion of error interval is proposed for selecting base learners. Extensive computational studies performed on colorectal cancer data from SEER database demonstrate that the proposed two-stage model can achieve a more accurate prediction compared to the one-stage regression model. The results show that the proposed classification approach can effectively handle the imbalanced survivability data, and the proposed regression method outperforms several state-of-the-art regression models.
Year
DOI
Venue
2019
10.1016/j.ins.2018.09.046
Information Sciences
Keywords
Field
DocType
Colorectal cancer,Survival prediction,Two-stage model,Ensemble learning,Semi-random regression tree
Survivability,Regression,Feature selection,Regression analysis,A priori and a posteriori,Artificial intelligence,Colorectal cancer,Cancer,Mathematics,Machine learning
Journal
Volume
ISSN
Citations 
474
0020-0255
3
PageRank 
References 
Authors
0.38
19
6
Name
Order
Citations
PageRank
Yuyan Wang1156.55
Dujuan Wang29114.07
Xin Ye351.41
Yan-zhang Wang45919.56
Yunqiang Yin568643.62
Yaochu Jin66457330.45