Title
Subsampled Information Criteria for Bayesian Model Selection in the Big Data Setting
Abstract
Bayesian methods face unprecedented challenges in the era of big data, as the evaluation of likelihood in each iteration is computationally intensive. To deal with this bottleneck, recent literature focus mostly on speeding up Markov chain Monte Carlo (MCMC). Model selection, which is an important topic, has not received much attention. In the Bayesian context, deviance-based criteria, such as the deviance information criterion (DIC), are well-known for model selection purposes. In this article, we introduce the subsampled DIC and the subsampled information criterion IC in the big data context. Extensive simulation studies are conducted to evaluate the empirical performance of the proposed criterion. The usage of our proposed criterion is further illustrated with an analysis of the Covertype dataset.
Year
DOI
Venue
2019
10.1109/BigData47090.2019.9006275
2019 IEEE International Conference on Big Data (Big Data)
Keywords
Field
DocType
DIC,IC,MCMC,Nonuniform Subsample
Deviance information criterion,Data mining,Bottleneck,Bayesian inference,Markov chain Monte Carlo,Information Criteria,Computer science,Model selection,Artificial intelligence,Big data,Machine learning,Bayesian probability
Conference
ISSN
ISBN
Citations 
2639-1589
978-1-7281-0859-9
0
PageRank 
References 
Authors
0.34
0
3
Name
Order
Citations
PageRank
Lijiang Geng100.34
Yishu Xue200.34
Guanyu Hu300.68