Abstract | ||
---|---|---|
Source code and metric mining have been used to successfully assist with software quality evaluation. This paper presents a data mining approach which incorporates clustering Java classes, as well as classifying extracted clusters, in order to assess internal software quality. We use Java classes as entities and static metrics as attributes for data mining. We identify outliers and apply K- means clustering in order to establish clusters of classes. Outliers indicate potentially fault prone classes, whilst clusters are examined so that we can establish common characteristics. Subsequently, we apply C4.5 to build classification trees for identifying metrics which determine cluster membership. We evaluate the proposed approach with two well known open source software systems, Jedit and Apache Geronimo. Results have consolidated key findings from previous work and indicated that combining clustering with classification produces better results than stand alone clustering. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1007/978-3-319-07064-3_22 | ARTIFICIAL INTELLIGENCE: METHODS AND APPLICATIONS |
Field | DocType | Volume |
Java classes,Data mining,Cluster (physics),Computer science,Source code,Cyclomatic complexity,Outlier,Artificial intelligence,Conceptual clustering,Software quality,Cluster analysis,Machine learning | Conference | 8445 |
ISSN | Citations | PageRank |
0302-9743 | 1 | 0.34 |
References | Authors | |
13 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Diomidis Papas | 1 | 1 | 0.34 |
Christos Tjortjis | 2 | 173 | 24.40 |