Comparison-Based Inverse Classification for Interpretability in Machine Learning. - Citegraph

Paper Info

Title
Comparison-Based Inverse Classification for Interpretability in Machine Learning.

Abstract
In the context of post-hoc interpretability, this paper addresses the task of explaining the prediction of a classifier, considering the case where no information is available, neither on the classifier itself, nor on the processed data (neither the training nor the test data). It proposes an inverse classification approach whose principle consists in determining the minimal changes needed to alter a prediction: in an instance-based framework, given a data point whose classification must be explained, the proposed method consists in identifying a close neighbor classified differently, where the closeness definition integrates a sparsity constraint. This principle is implemented using observation generation in the Growing Spheres algorithm. Experimental results on two datasets illustrate the relevance of the proposed approach that can be used to gain knowledge about the classifier.

Year	DOI	Venue
2018	10.1007/978-3-319-91473-2_9	Communications in Computer and Information Science
Keywords	Field	DocType
Post-hoc interpretability,Comparison-based,Inverse classification,Local explanation	Interpretability,Inverse,Closeness,Computer science,Test data,Artificial intelligence,Classifier (linguistics),Machine learning	Conference
Volume	ISSN	Citations
853	1865-0929	2
PageRank	References	Authors
0.37	10	5

Authors (5 rows)

Cited by (2 rows)

References (10 rows)

Name	Order	Citations	PageRank
Thibault Laugel	1	9	2.87
Marie-Jeanne Lesot	2	220	32.41
Christophe Marsala	3	237	34.77
Xavier Renard	4	9	2.54
Marcin Detyniecki	5	330	39.95

1