Title | ||
---|---|---|
An improved method to simplify software metric models constructed with incomplete data samples |
Abstract | ||
---|---|---|
Software metric models are useful in predicting the target software metric(s) for any future software project based on the project's predictor metric(s). Obviously, the construction of such a model makes use of a data sample of such metrics from analogous past projects. However, incomplete data often appear in such data samples. Worse still, the necessity to include a particular continuous predictor metric or a particular category for a certain categorical predictor metric is most likely based on an experience-related intuition that the continuous predictor metric or the category matters to the target metric. However, in the presence of incomplete data, this intuition is traditionally not verifiable “retrospectively” after the model is constructed, leading to redundant continuous predictor metric(s) and/or excessive categorization for categorical predictor metrics. As an improvement of the author's previous work to solve all these problems, this paper proposes a methodology incorporating the k-nearest neighbors (k-NN) multiple imputation method, kernel smoothing, Monte Carlo simulation, and stepwise regression. This paper documents this methodology and one experiment on it. |
Year | DOI | Venue |
---|---|---|
2010 | 10.1109/FSKD.2010.5569384 | FSKD |
Keywords | Field | DocType |
kernel smoothing,software management,multiple imputation method,software project,project predictor metric,software metric model,categorical predictor metric,stepwise regression,multiple imputation,monte carlo simulation,missing data,redundant continuous predictor metric,software metrics,experience-related intuition,k-nearest neighbor,incomplete data sample,model simplification,k nearest neighbor,predictive models,software quality,data models,software metric | Data mining,Data modeling,Sample (statistics),Computer science,Categorical variable,Software,Artificial intelligence,Software metric,Missing data,Imputation (statistics),Software quality,Machine learning | Conference |
Volume | ISBN | Citations |
4 | 978-1-4244-5931-5 | 2 |
PageRank | References | Authors |
0.36 | 30 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Tianfa Xie | 1 | 2 | 1.04 |
W. Eric Wong | 2 | 284 | 56.75 |