Abstract | ||
---|---|---|
A discriminatory dataset refers to a dataset with undesirable correlation between sensitive attributes and the class label, which often leads to biased decision making in data analytics processes. This paper investigates how to build discrimination-aware models even when the available training set is intrinsically discriminating based on some sensitive attributes, such as race, gender or personal status. We propose a new classification method called Discrimination-Aware Association Rule classifier (DAAR), which integrates a new discrimination-aware measure and an association rule mining algorithm. We evaluate the performance of DAAR on three real datasets from different domains and compare it with two non-discrimination-aware classifiers (a standard association rule classification algorithm and the state-of-the-art association rule algorithm SPARCCC), and also with a recently proposed discrimination-aware decision tree method. The results show that DAAR is able to effectively filter out the discriminatory rules and decrease the discrimination on all datasets with insignificant impact on the predictive accuracy. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1007/978-3-319-22729-0_9 | Lecture Notes in Computer Science |
Keywords | Field | DocType |
Discrimination-aware data mining,Association rule classification,Unbiased decision making | Training set,Decision tree,Data mining,Data analysis,Computer science,Association rule learning,Personal status,Correlation,Artificial intelligence,Classifier (linguistics),Machine learning | Conference |
Volume | ISSN | Citations |
9263 | 0302-9743 | 0 |
PageRank | References | Authors |
0.34 | 9 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ling Luo | 1 | 8 | 3.12 |
Wei Liu | 2 | 468 | 37.36 |
Irena Koprinska | 3 | 783 | 64.00 |
Fang Chen | 4 | 156 | 49.84 |