Uncovering Semantic Bias in Neural Network Models Using a Knowledge Graph - Citegraph

Paper Info

Title
Uncovering Semantic Bias in Neural Network Models Using a Knowledge Graph

Abstract
While neural networks models have shown impressive performance in many NLP tasks, lack of interpretability is often seen as a disadvantage. Individual relevance scores assigned by post-hoc explanation methods are not sufficient to show deeper systematic preferences and potential biases of the model that apply consistently across examples. In this paper we apply rule mining using knowledge graphs in combination with neural network explanation methods to uncover such systematic preferences of trained neural models and capture them in the form of conjunctive rules. We test our approach in the context of text classification tasks and show that such rules are able to explain a substantial part of the model behaviour as well as indicate potential causes of misclassifications when the model is applied outside of the initial training context.

Year	DOI	Venue
2020	10.1145/3340531.3412009	CIKM '20: The 29th ACM International Conference on Information and Knowledge Management Virtual Event Ireland October, 2020
DocType	ISBN	Citations
Conference	978-1-4503-6859-9	0
PageRank	References	Authors
0.34	21	2

Authors (2 rows)

Cited by (0 rows)

References (21 rows)

Name	Order	Citations	PageRank
Andriy Nikolov	1	769	53.09
Mathieu d'Aquin	2	1227	106.53

1