A Survey on Neural Trojans - Citegraph

Paper Info

Title
A Survey on Neural Trojans

Abstract
Neural networks have become increasingly prevalent in many real-world applications including security critical ones. Due to the high hardware requirement and time consumption to train high-performance neural network models, users often outsource training to a machine-learning-as-a-service (MLaaS) provider. This puts the integrity of the trained model at risk. In 2017, Liu et al. found that, by mixing the training data with a few malicious samples of a certain trigger pattern, hidden functionality can be embedded in the trained network which can be evoked by the trigger pattern [33]. We refer to this kind of hidden malicious functionality as neural Trojans. In this paper, we survey a myriad of neural Trojan attack and defense techniques that have been proposed over the last few years. In a neural Trojan insertion attack, the attacker can be the MLaaS provider itself or a third party capable of adding or tampering with training data. In most research on attacks, the attacker selects the Trojan's functionality and a set of input patterns that will trigger the Trojan. Training data poisoning is the most common way to make the neural network acquire the Trojan functionality. Trojan embedding methods that modify the training algorithm or directly interfere with the neural network's execution at the binary level have also been studied. Defense techniques include detecting neural Trojans in the model and/or Trojan trigger patterns, erasing the Trojan's functionality from the neural network model, and bypassing the Trojan. It was also shown that carefully crafted neural Trojans can be used to mitigate other types of attacks. We systematize the above attack and defense approaches in this paper.

Year	DOI	Venue
2020	10.1109/ISQED48828.2020.9137011	2020 21st International Symposium on Quality Electronic Design (ISQED)
DocType	Volume	ISSN
Journal	2020	1948-3287
ISBN	Citations	PageRank
978-1-7281-4207-4	2	0.39
References	Authors
0	7

Authors (7 rows)

Cited by (2 rows)

References (0 rows)

Name	Order	Citations	PageRank
Yun-Tao Liu	1	29	7.42
Ankit Mondal	2	2	1.40
Abhishek Chakraborty	3	2	0.39
Michael Zuzak	4	16	2.37
Nina Jacobsen	5	2	0.39
Daniel Xing	6	2	0.39
Ankur Srivastava	7	902	79.64

1