Title
Optimization of breast mass classification using sequential forward floating selection (SFFS) and a support vector machine (SVM) model.
Abstract
Improving radiologists' performance in classification between malignant and benign breast lesions is important to increase cancer detection sensitivity and reduce false-positive recalls. For this purpose, developing computer-aided diagnosis schemes has been attracting research interest in recent years. In this study, we investigated a new feature selection method for the task of breast mass classification.We initially computed 181 image features based on mass shape, spiculation, contrast, presence of fat or calcifications, texture, isodensity, and other morphological features. From this large image feature pool, we used a sequential forward floating selection (SFFS)-based feature selection method to select relevant features and analyzed their performance using a support vector machine (SVM) model trained for the classification task. On a database of 600 benign and 600 malignant mass regions of interest, we performed the study using a tenfold cross-validation method. Feature selection and optimization of the SVM parameters were conducted on the training subsets only.The area under the receiver operating characteristic curve [Formula: see text] was obtained for the classification task. The results also showed that the most frequently selected features by the SFFS-based algorithm in tenfold iterations were those related to mass shape, isodensity, and presence of fat, which are consistent with the image features frequently used by radiologists in the clinical environment for mass classification. The study also indicated that accurately computing mass spiculation features from the projection mammograms was difficult, and failed to perform well for the mass classification task due to tissue overlap within the benign mass regions.In conclusion, this comprehensive feature analysis study provided new and valuable information for optimizing computerized mass classification schemes that may have potential to be useful as a "second reader" in future clinical practice.
Year
DOI
Venue
2014
10.1007/s11548-014-0992-1
Int. J. Computer Assisted Radiology and Surgery
Keywords
Field
DocType
Computer-aided diagnosis of mammograms, Breast cancer, Pattern classification, Feature selection
Data mining,Text mining,Breast cancer,Feature selection,Mass classification,Computer science,Support vector machine,Cancer detection
Journal
Volume
Issue
ISSN
9
6
1861-6429
Citations 
PageRank 
References 
12
0.54
27
Authors
3
Name
Order
Citations
PageRank
Maxine Tan14410.61
Jiantao Pu227723.12
Bin Zheng3262.01