CLEAR - Clean-up Sample-Targeted Backdoor in Neural Networks. - Citegraph

Paper Info

Title
CLEAR - Clean-up Sample-Targeted Backdoor in Neural Networks.

Abstract
The data poisoning attack has raised serious security concerns on the safety of deep neural networks since it can lead to neural backdoor that misclassifies certain inputs crafted by an attacker. In particular, the sample-targeted backdoor attack is a new challenge. It targets at one or a few specific samples, called target samples, to misclassify them to a target class. Without a trigger planted in the backdoor model, the existing backdoor detection schemes fail to detect the sample-targeted backdoor as they depend on reverse-engineering the trigger or strong features of the trigger. In this paper, we propose a novel scheme to detect and mitigate sample-targeted backdoor attacks. We discover and demonstrate a unique property of the sample-targeted backdoor, which forces a boundary change such that small"pockets"are formed around the target sample. Based on this observation, we propose a novel defense mechanism to pinpoint a malicious pocket by"wrapping"them into a tight convex hull in the feature space. We design an effective algorithm to search for such a convex hull and remove the backdoor by fine-tuning the model using the identified malicious samples with the corrected label according to the convex hull. The experiments show that the proposed approach is highly efficient for detecting and mitigating a wide range of sample-targeted backdoor attacks.

Year	DOI	Venue
2021	10.1109/ICCV48922.2021.01614	ICCV
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
0	5

Authors (5 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Liuwan Zhu	1	1	1.36
Rui Ning	2	1	2.03
Chunsheng Xin	3	12	5.61
Chonggang Wang	4	0	0.34
Hongyi Wu	5	848	76.90

1