Title
Supervised Multi-View Canonical Correlation Analysis (sMVCCA): Integrating Histologic and Proteomic Features for Predicting Recurrent Prostate Cancer
Abstract
In this work, we present a new methodology to facilitate prediction of recurrent prostate cancer (CaP) following radical prostatectomy (RP) via the integration of quantitative image features and protein expression in the excised prostate. Creating a fused predictor from high-dimensional data streams is challenging because the classifier must 1) account for the “curse of dimensionality” problem, which hinders classifier performance when the number of features exceeds the number of patient studies and 2) balance potential mismatches in the number of features across different channels to avoid classifier bias towards channels with more features. Our new data integration methodology, supervised Multi-view Canonical Correlation Analysis (sMVCCA), aims to integrate infinite views of highdimensional data to provide more amenable data representations for disease classification. Additionally, we demonstrate sMVCCA using Spearman's rank correlation which, unlike Pearson's correlation, can account for nonlinear correlations and outliers. Forty CaP patients with pathological Gleason scores 6-8 were considered for this study. 21 of these men revealed biochemical recurrence (BCR) following RP, while 19 did not. For each patient, 189 quantitative histomorphometric attributes and 650 protein expression levels were extracted from the primary tumor nodule. The fused histomorphometric/proteomic representation via sMVCCA combined with a random forest classifier predicted BCR with a mean AUC of 0.74 and a maximum AUC of 0.9286. We found sMVCCA to perform statistically significantly (p <; 0.05) better than comparative state-of-the-art data fusion strategies for predicting BCR. Furthermore, Kaplan-Meier analysis demonstrated improved BCR-free survival prediction for the sMVCCA-fused classifier as compared to histology or proteomic features alone.
Year
DOI
Venue
2015
10.1109/TMI.2014.2355175
IEEE Trans. Med. Imaging
Keywords
Field
DocType
Data fusion, digital pathology, dimensionality reduction, mass spectrometry, prostate cancer, proteomics
Rank correlation,Dimensionality reduction,Pattern recognition,Canonical correlation,Feature (computer vision),Computer science,Correlation,Artificial intelligence,Biochemical recurrence,Classifier (linguistics),Random forest
Journal
Volume
Issue
ISSN
34
1
0278-0062
Citations 
PageRank 
References 
15
0.68
25
Authors
10
Name
Order
Citations
PageRank
George Lee1674.05
Asha Singanamalli2181.78
Haibo Wang3181.76
Michael Feldman451835.49
Stephen R. Master5753.19
Natalie N. C. Shih6150.68
Elaine Spangler7150.68
Timothy Rebbeck8150.68
John Tomaszewski932818.14
Anant Madabhushi101736139.21