Title
The Right to Be Forgotten: Towards Machine Learning on Perturbed Knowledge Bases.
Abstract
Today's increasingly complex information infrastructures represent the basis of any data-driven industries which are rapidly becoming the 21st century's economic backbone. The sensitivity of those infrastructures to disturbances in their knowledge bases is therefore of crucial interest for companies, organizations, customers and regulating bodies. This holds true with respect to the direct provisioning of such information in crucial applications like clinical settings or the energy industry, but also when considering additional insights, predictions and personalized services that are enabled by the automatic processing of those data. In the light of new EU Data Protection regulations applying from 2018 onwards which give customers the right to have their data deleted on request, information processing bodies will have to react to these changing jurisdictional (and therefore economic) conditions. Their choices include a re-design of their data infrastructure as well as preventive actions like anonymization of databases per default. Therefore, insights into the effects of perturbed/anonymized knowledge bases on the quality of machine learning results are a crucial basis for successfully facing those future challenges. In this paper we introduce a series of experiments we conducted on applying four different classifiers to an established dataset, as well as several distorted versions of it and present our initial results.
Year
DOI
Venue
2016
10.1007/978-3-319-45507-5_17
Lecture Notes in Computer Science
Keywords
Field
DocType
Machine learning,Knowledge bases,Right to be forgotten,Perturbation,Anonymization,k-anonymity,SaNGreeA,Information loss,Structural loss,Cost weighing vector,Interactive machine learning
Data science,Data mining,Computer security,Computer science,Data Protection Directive,Artificial intelligence,Automatic processing,Information loss,Information processing,k-anonymity,Provisioning,Right to be forgotten,Health informatics,Machine learning
Conference
Volume
ISSN
Citations 
9817
0302-9743
5
PageRank 
References 
Authors
0.46
10
4
Name
Order
Citations
PageRank
Bernd Malle1223.17
Peter Kieseberg218729.39
Edgar Weippl3856105.02
Andreas Holzinger42886253.75