Label-Only Membership Inference Attacks - Citegraph

Paper Info

Title
Label-Only Membership Inference Attacks

Abstract
Membership inference is one of the simplest privacy threats faced by machine learning models that are trained on private sensitive data. In this attack, an adversary infers whether a particular point was used to train the model, or not, by observing the model's predictions. Whereas current attack methods all require access to the model's predicted confidence score, we introduce a labelonly attack that instead evaluates the robustness of the model's predicted (hard) labels under perturbations of the input, to infer membership. Our label-only attack is not only as-effective as attacks requiring access to confidence scores, it also demonstrates that a class of defenses against membership inference, which we call "confidence masking" because they obfuscate the confidence scores to thwart attacks, are insufficient to prevent the leakage of private information. Our experiments show that training with differential privacy or strong l(2) regularization are the only current defenses that meaningfully decrease leakage of private information, even for points that are outliers of the training distribution.

Year	Venue	DocType
2021	INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139	Conference
Volume	ISSN	Citations
139	2640-3498	0
PageRank	References	Authors
0.34	0	4

Authors (4 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Christopher A. Choquette Choo	1	0	0.34
Florian Tramèr	2	463	26.53
Nicholas Carlini	3	1599	63.23
Nicolas Papernot	4	0	1.69

1