Multiple-boundary clustering and prioritization to promote neural network retraining - Citegraph

Paper Info

Title
Multiple-boundary clustering and prioritization to promote neural network retraining

Abstract
ABSTRACTWith the increasing application of deep learning (DL) models in many safety-critical scenarios, effective and efficient DL testing techniques are much in demand to improve the quality of DL models. One of the major challenges is the data gap between the training data to construct the models and the testing data to evaluate them. To bridge the gap, testers aim to collect an effective subset of inputs from the testing contexts, with limited labeling effort, for retraining DL models. To assist the subset selection, we propose Multiple-Boundary Clustering and Prioritization (MCP), a technique to cluster test samples into the boundary areas of multiple boundaries for DL models and specify the priority to select samples evenly from all boundary areas, to make sure enough useful samples for each boundary reconstruction. To evaluate MCP, we conduct an extensive empirical study with three popular DL models and 33 simulated testing contexts. The experiment results show that, compared with state-of-the-art baseline methods, on effectiveness, our approach MCP has a significantly better performance by evaluating the improved quality of retrained DL models; on efficiency, MCP also has the advantages in time costs.

Year	DOI	Venue
2020	10.1145/3324884.3416621	ASE
Keywords	DocType	ISSN
Software testing, Deep learning, Multiple-Boundary, Neural network, Retraining	Conference	1527-1366
ISBN	Citations	PageRank
978-1-7281-7281-1	1	0.37
References	Authors
31	6

Authors (6 rows)

Cited by (1 rows)

References (31 rows)

Name	Order	Citations	PageRank
Weijun Shen	1	7	1.14
Yanhui Li	2	141	15.04
Lin Chen	3	49	6.78
YuanLei Han	4	1	0.37
Yuming Zhou	5	559	24.72
Xu, Baowen	6	2476	165.27

1