Title
Going Beyond Provenance: Explaining Query Answers with Pattern-based Counterbalances
Abstract
Provenance and intervention-based techniques have been used to explain surprisingly high or low outcomes of aggregation queries. However, such techniques may miss interesting explanations emerging from data that is not in the provenance. For instance, an unusually low number of publications of a prolific researcher in a certain venue and year can be explained by an increased number of publications in another venue in the same year. We present a novel approach for explaining outliers in aggregation queries through counter- balancing. That is, explanations are outliers in the opposite direction of the outlier of interest. Outliers are defined w.r.t. patterns that hold over the data in aggregate. We present efficient methods for mining such aggregate regression pat- terns (ARPs), discuss how to use ARPs to generate and rank explanations, and experimentally demonstrate the efficiency and effectiveness of our approach.
Year
DOI
Venue
2019
10.1145/3299869.3300066
Proceedings of the 2019 International Conference on Management of Data
Keywords
Field
DocType
aggregate queries, data analysis, explanations, pattern mining, provenance, regression
Information retrieval,Computer science,Provenance,Database
Conference
Volume
ISSN
ISBN
2019
0730-8078
978-1-4503-5643-5
Citations 
PageRank 
References 
1
0.36
0
Authors
4
Name
Order
Citations
PageRank
Zhengjie Miao1116.61
Qitian Zeng212.05
Boris Glavic328436.70
Sudeepa Roy426830.95