Adversarial Risk and the Dangers of Evaluating Against Weak Attacks. - Citegraph

Paper Info

Title
Adversarial Risk and the Dangers of Evaluating Against Weak Attacks.

Abstract
This paper investigates recently proposed approaches for defending against adversarial examples and evaluating adversarial robustness. The existence of adversarial examples in trained neural networks reflects the fact that expected risk alone does not capture the modelu0027s performance against worst-case inputs. We motivate the use of adversarial risk as an objective, although it cannot easily be computed exactly. We then frame commonly used attacks and evaluation metrics as defining a tractable surrogate objective to the true adversarial risk. This suggests that models may be obscured to adversaries, by optimizing this surrogate rather than the true adversarial risk. We demonstrate that this is a significant problem in practice by repurposing gradient-free optimization techniques into adversarial attacks, which we use to decrease the accuracy of several recently proposed defenses to near zero. Our hope is that our formulations and results will help researchers to develop more powerful defenses.

Year	Venue	DocType
2018	ICML	Conference
Volume	Citations	PageRank
abs/1802.05666	18	0.74
References	Authors
30	4

Authors (4 rows)

Cited by (18 rows)

References (30 rows)

Name	Order	Citations	PageRank
Jonathan Uesato	1	85	6.60
Brendan O'Donoghue	2	172	10.19
Aäron Van Den Oord	3	1585	64.43
Pushmeet Kohli	4	7398	332.84

1