Title
Variable importance in nonlinear kernels (VINK): classification of digitized histopathology.
Abstract
Quantitative histomorphometry is the process of modeling appearance of disease morphology on digitized histopathology images via image-based features (e.g., texture, graphs). Due to the curse of dimensionality, building classifiers with large numbers of features requires feature selection (which may require a large training set) or dimensionality reduction (DR). DR methods map the original high-dimensional features in terms of eigenvectors and eigenvalues, which limits the potential for feature transparency or interpretability. Although methods exist for variable selection and ranking on embeddings obtained via linear DR schemes (e. g., principal components analysis (PCA)), similar methods do not yet exist for nonlinear DR (NLDR) methods. In this work we present a simple yet elegant method for approximating the mapping between the data in the original feature space and the transformed data in the kernel PCA (KPCA) embedding space; this mapping provides the basis for quantification of variable importance in nonlinear kernels (VINK). We show how VINK can be implemented in conjunction with the popular Isomap and Laplacian eigenmap algorithms. VINK is evaluated in the contexts of three different problems in digital pathology: (1) predicting five year PSA failure following radical prostatectomy, (2) predicting Oncotype DX recurrence risk scores for ER+ breast cancers, and (3) distinguishing good and poor outcome p16+ oropharyngeal tumors. We demonstrate that subsets of features identified by VINK provide similar or better classification or regression performance compared to the original high dimensional feature sets.
Year
DOI
Venue
2013
10.1007/978-3-642-40763-5_30
Lecture Notes in Computer Science
Field
DocType
Volume
Interpretability,Feature vector,Dimensionality reduction,Feature selection,Pattern recognition,Computer science,Kernel principal component analysis,Curse of dimensionality,Artificial intelligence,Principal component analysis,Isomap
Conference
8150
Issue
ISSN
Citations 
Pt 2
0302-9743
2
PageRank 
References 
Authors
0.38
8
5
Name
Order
Citations
PageRank
Shoshana Ginsburg1102.40
Sahirzeeshan Ali2936.47
George Lee320.38
Ajay Basavanhally419011.19
Anant Madabhushi51736139.21