Title
Identifying Similar Software Datasets through Fuzzy Inference System
Abstract
Similar software have similar software measurements. Defect data from one software can be used to anticipate defects in a similar software. Although, not many defect datasets are made public in software engineering domain, PROMISE repository is a reasonable collection of software data. This paper presents a two step approach to identify similar software and applies the proposed technique to find similar datasets in PROMISE repository. As step 1, the approach generates associations rules for each dataset to determine dataset's behavior in terms of frequent patterns. As step 2, overlap between the association rules is calculated using Fuzzy Inference Systems (FIS). The FIS generated for the study have been expert-based as well as auto-generated. Similarity between 28 dataset pairs has been found KC2 and PC1 turned out to be most similar datasets with 86% similarity using Mamdani, 92% with Sugeno models. Results from expert-based and auto generated FIS have been comparable.
Year
DOI
Venue
2012
10.1109/FIT.2012.40
Frontiers of Information Technology
Keywords
Field
DocType
similar software measurement,promise repository,similar software datasets,defect datasets,software engineering domain,similar datasets,similar software,software data,fuzzy inference system,dataset pair,defect data,step approach,data handling,data mining,software reliability
Data mining,Fuzzy reasoning,Computer science,Fuzzy inference,Association rule learning,Software,Fuzzy control system,Software quality,Group method of data handling,Fuzzy inference system
Conference
ISBN
Citations 
PageRank 
978-1-4673-4946-8
0
0.34
References 
Authors
10
3
Name
Order
Citations
PageRank
Saba Anwar120.75
Zeeshan A. Rana200.34
Mian Awais35911.53