Empirical assessment of feature selection techniques in defect prediction models using web applications. - Citegraph

Paper Info

Title
Empirical assessment of feature selection techniques in defect prediction models using web applications.

Abstract
In order to minimize the over-fitting and related factors that are caused by the high dimensionality of the input data in software defect prediction, the attributes are often optimized using various feature selection techniques. However, the comparative performance of these selection techniques in combination with machine learning algorithms remains largely unexplored using web applications. In this work, we investigate the best possible combination of feature selection technique with machine learning algorithms, with the sample space chosen from open source Apache Click and Rave data sets. Our results are based on 945 defect prediction models derived from parametric, non-parametric and ensemble-based machine learning algorithms, for which the metrics are derived from the various filter and threshold-based ranking techniques. Friedman and Nemenyi post-hoc statistical tests are adopted to identify the performance difference of these models. We find that filter-based feature selection in combination with ensemble-based machine learning algorithms not only poise as the best strategy but also yields a maximum feature set redundancy by 94%, with little or no comprise on the performance index.

Year	DOI	Venue
2019	10.3233/JIFS-18473	JOURNAL OF INTELLIGENT & FUZZY SYSTEMS
Keywords	Field	DocType
Feature selection,feature ranking,machine learning,web application quality	Empirical assessment,Feature selection,Artificial intelligence,Predictive modelling,Web application,Machine learning,Mathematics	Journal
Volume	Issue	ISSN
36	6	1064-1246
Citations	PageRank	References
0	0.34	0
Authors
2

Authors (2 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Ruchika Malhotra	1	533	35.12
Anjali Sharma	2	6	3.87

1