Galaxy: Towards Scalable and Interpretable Explanation on High-Dimensional and Spatio-Temporal Correlated Climate Data - Citegraph

Paper Info

Title
Galaxy: Towards Scalable and Interpretable Explanation on High-Dimensional and Spatio-Temporal Correlated Climate Data

Abstract
Interpretability has become a major criterion for designing predictive models in climate science. High interpretability can provide qualitative understanding between the meteorological variables and the climate phenomena which is helpful for climate scientists to learn causes of climate events. However, detecting the features which have efficient interpretability to observed events is challenging in spatio-temporal climate data because the key features may be overlooked by the redundancy due to the high degree of spatial and temporal correlations among the features, especially in high dimensionality. Furthermore, climate events occurred in different regions or different times may have different explanatory patterns, detecting explanations for overall climate phenomena is also difficult. Here we propose Galaxy, a new interpretable predictive model. Galaxy allows us to represent the explanatory patterns of subpopulations within an overall population of the target. Each explanatory pattern is defined by the smallest feature subset that the conditional distribution of target actually depends on, which we define as the minimal target explanation. Based on the detection of such explanatory patterns, Galaxy can detect the Galaxy space, the explanations for the overall target population, by sequentially detecting target explanation of every individual subpopulation of the target, and then forecast the target variable by their ensemble predictive power. We validate our approach by comparing Galaxy to several state-of-the-art baselines in a set of comparative experiments and then evaluate how Galaxy can be used to identify the explanatory space and give a referential explanation report in a real-world scenario on a given location in the United States.

Year	DOI	Venue
2018	10.1109/ICBK.2018.00027	2018 IEEE International Conference on Big Knowledge (ICBK)
Keywords	Field	DocType
Interpretable explanation, Long-lead rainfall forecasting, AdaBoost	Population,Interpretability,Conditional probability distribution,Predictive power,Curse of dimensionality,Feature extraction,Redundancy (engineering),Artificial intelligence,Galaxy,Machine learning	Conference
ISBN	Citations	PageRank
978-1-5386-9126-7	0	0.34
References	Authors
10	6

Authors (6 rows)

Cited by (0 rows)

References (10 rows)

Name	Order	Citations	PageRank
Yong Zhuang	1	254	13.88
David L. Small	2	0	0.34
Xin Shu	3	31	6.50
Kui Yu	4	36	10.16
Shafiqul Islam	5	33	7.94
Wei Ding	6	834	72.61

1