Title
Multiple-boundary clustering and prioritization to promote neural network retraining
Abstract
ABSTRACTWith the increasing application of deep learning (DL) models in many safety-critical scenarios, effective and efficient DL testing techniques are much in demand to improve the quality of DL models. One of the major challenges is the data gap between the training data to construct the models and the testing data to evaluate them. To bridge the gap, testers aim to collect an effective subset of inputs from the testing contexts, with limited labeling effort, for retraining DL models. To assist the subset selection, we propose Multiple-Boundary Clustering and Prioritization (MCP), a technique to cluster test samples into the boundary areas of multiple boundaries for DL models and specify the priority to select samples evenly from all boundary areas, to make sure enough useful samples for each boundary reconstruction. To evaluate MCP, we conduct an extensive empirical study with three popular DL models and 33 simulated testing contexts. The experiment results show that, compared with state-of-the-art baseline methods, on effectiveness, our approach MCP has a significantly better performance by evaluating the improved quality of retrained DL models; on efficiency, MCP also has the advantages in time costs.
Year
DOI
Venue
2020
10.1145/3324884.3416621
ASE
Keywords
DocType
ISSN
Software testing, Deep learning, Multiple-Boundary, Neural network, Retraining
Conference
1527-1366
ISBN
Citations 
PageRank 
978-1-7281-7281-1
1
0.37
References 
Authors
31
6
Name
Order
Citations
PageRank
Weijun Shen171.14
Yanhui Li214115.04
Lin Chen3496.78
YuanLei Han410.37
Yuming Zhou555924.72
Xu, Baowen62476165.27