Title
Poisoning Attacks on Fair Machine Learning
Abstract
Both fair machine learning and adversarial learning have been extensively studied. However, attacking fair machine learning models has received less attention. In this paper, we present a framework that seeks to effectively generate poisoning samples to attack both model accuracy and algorithmic fairness. Our attacking framework can target fair machine learning models trained with a variety of group based fairness notions such as demographic parity and equalized odds. We develop three online attacks, adversarial sampling, adversarial labeling, and adversarial feature modification. All three attacks effectively and efficiently produce poisoning samples via sampling, labeling, or modifying a fraction of training data in order to reduce the test accuracy. Our framework enables attackers to flexibly adjust the attack’s focus on prediction accuracy or fairness and accurately quantify the impact of each candidate point to both accuracy loss and fairness violation, thus producing effective poisoning samples. Experiments on two real datasets demonstrate the effectiveness and efficiency of our framework.
Year
DOI
Venue
2022
10.1007/978-3-031-00123-9_30
Database Systems for Advanced Applications
Keywords
DocType
ISSN
Poisoning attacks, Algorithmic fairness, Adversarial machine learning.
Conference
0302-9743
Citations 
PageRank 
References 
0
0.34
0
Authors
4
Name
Order
Citations
PageRank
Minh-Hao Van100.34
Wei Du200.68
Xintao Wu389276.91
Aidong Lu435330.18