An improved method to simplify software metric models constructed with incomplete data samples - Citegraph

Paper Info

Title
An improved method to simplify software metric models constructed with incomplete data samples

Abstract
Software metric models are useful in predicting the target software metric(s) for any future software project based on the project's predictor metric(s). Obviously, the construction of such a model makes use of a data sample of such metrics from analogous past projects. However, incomplete data often appear in such data samples. Worse still, the necessity to include a particular continuous predictor metric or a particular category for a certain categorical predictor metric is most likely based on an experience-related intuition that the continuous predictor metric or the category matters to the target metric. However, in the presence of incomplete data, this intuition is traditionally not verifiable “retrospectively” after the model is constructed, leading to redundant continuous predictor metric(s) and/or excessive categorization for categorical predictor metrics. As an improvement of the author's previous work to solve all these problems, this paper proposes a methodology incorporating the k-nearest neighbors (k-NN) multiple imputation method, kernel smoothing, Monte Carlo simulation, and stepwise regression. This paper documents this methodology and one experiment on it.

Year	DOI	Venue
2010	10.1109/FSKD.2010.5569384	FSKD
Keywords	Field	DocType
kernel smoothing,software management,multiple imputation method,software project,project predictor metric,software metric model,categorical predictor metric,stepwise regression,multiple imputation,monte carlo simulation,missing data,redundant continuous predictor metric,software metrics,experience-related intuition,k-nearest neighbor,incomplete data sample,model simplification,k nearest neighbor,predictive models,software quality,data models,software metric	Data mining,Data modeling,Sample (statistics),Computer science,Categorical variable,Software,Artificial intelligence,Software metric,Missing data,Imputation (statistics),Software quality,Machine learning	Conference
Volume	ISBN	Citations
4	978-1-4244-5931-5	2
PageRank	References	Authors
0.36	30	2

Authors (2 rows)

Cited by (2 rows)

References (30 rows)

Name	Order	Citations	PageRank
Tianfa Xie	1	2	1.04
W. Eric Wong	2	284	56.75

1