Title
An improved method to simplify software metric models constructed with incomplete data samples
Abstract
Software metric models are useful in predicting the target software metric(s) for any future software project based on the project's predictor metric(s). Obviously, the construction of such a model makes use of a data sample of such metrics from analogous past projects. However, incomplete data often appear in such data samples. Worse still, the necessity to include a particular continuous predictor metric or a particular category for a certain categorical predictor metric is most likely based on an experience-related intuition that the continuous predictor metric or the category matters to the target metric. However, in the presence of incomplete data, this intuition is traditionally not verifiable “retrospectively” after the model is constructed, leading to redundant continuous predictor metric(s) and/or excessive categorization for categorical predictor metrics. As an improvement of the author's previous work to solve all these problems, this paper proposes a methodology incorporating the k-nearest neighbors (k-NN) multiple imputation method, kernel smoothing, Monte Carlo simulation, and stepwise regression. This paper documents this methodology and one experiment on it.
Year
DOI
Venue
2010
10.1109/FSKD.2010.5569384
FSKD
Keywords
Field
DocType
kernel smoothing,software management,multiple imputation method,software project,project predictor metric,software metric model,categorical predictor metric,stepwise regression,multiple imputation,monte carlo simulation,missing data,redundant continuous predictor metric,software metrics,experience-related intuition,k-nearest neighbor,incomplete data sample,model simplification,k nearest neighbor,predictive models,software quality,data models,software metric
Data mining,Data modeling,Sample (statistics),Computer science,Categorical variable,Software,Artificial intelligence,Software metric,Missing data,Imputation (statistics),Software quality,Machine learning
Conference
Volume
ISBN
Citations 
4
978-1-4244-5931-5
2
PageRank 
References 
Authors
0.36
30
2
Name
Order
Citations
PageRank
Tianfa Xie121.04
W. Eric Wong228456.75