Title
Automated Functional Dependency Detection Between Test Cases Using Doc2Vec and Clustering
Abstract
Knowing about dependencies and similarities between test cases is beneficial for prioritizing them for cost-effective test execution. This holds especially true for the time consuming, manual execution of integration test cases written in natural language. Test case dependencies are typically derived from requirements and design artifacts. However, such artifacts are not always available, and the derivation process can be very time-consuming. In this paper, we propose, apply and evaluate a novel approach that derives test cases' similarities and functional dependencies directly from the test specification documents written in natural language, without requiring any other data source. Our approach uses an implementation of Doc2Vec algorithm to detect text-semantic similarities between test cases and then groups them using two clustering algorithms HDBSCAN and FCM. The correlation between test case text-semantic similarities and their functional dependencies is evaluated in the context of an on-board train control system from Bombardier Transportation AB in Sweden. For this system, the dependencies between the test cases were previously derived and are compared to the results our approach. The results show that of the two evaluated clustering algorithms, HDBSCAN has better performance than FCM or a dummy classifier. The classification methods' results are of reasonable quality and especially useful from an industrial point of view. Finally, performing a random undersampling approach to correct the imbalanced data distribution results in an F1 Score of up to 75% when applying the HDBSCAN clustering algorithm.
Year
DOI
Venue
2019
10.1109/AITest.2019.00-13
2019 IEEE International Conference On Artificial Intelligence Testing (AITest)
Keywords
Field
DocType
Software Testing,Paragraph Vectors,Test Case Dependency,Clustering Doc2Vec,HDBSCAN,FCM
Data mining,F1 score,Integration testing,Computer science,Undersampling,Functional dependency,Natural language,Test case,Classifier (linguistics),Cluster analysis
Conference
ISBN
Citations 
PageRank 
978-1-7281-0493-5
1
0.36
References 
Authors
0
5
Name
Order
Citations
PageRank
Sahar Tahvili1113.31
Leo Hatvani273.22
Michael Felderer353878.87
Wasif Afzal438830.92
Markus Bohlin57714.24