Abstract | ||
---|---|---|
Bayesian methods face unprecedented challenges in the era of big data, as the evaluation of likelihood in each iteration is computationally intensive. To deal with this bottleneck, recent literature focus mostly on speeding up Markov chain Monte Carlo (MCMC). Model selection, which is an important topic, has not received much attention. In the Bayesian context, deviance-based criteria, such as the deviance information criterion (DIC), are well-known for model selection purposes. In this article, we introduce the subsampled DIC and the subsampled information criterion IC in the big data context. Extensive simulation studies are conducted to evaluate the empirical performance of the proposed criterion. The usage of our proposed criterion is further illustrated with an analysis of the Covertype dataset. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1109/BigData47090.2019.9006275 | 2019 IEEE International Conference on Big Data (Big Data) |
Keywords | Field | DocType |
DIC,IC,MCMC,Nonuniform Subsample | Deviance information criterion,Data mining,Bottleneck,Bayesian inference,Markov chain Monte Carlo,Information Criteria,Computer science,Model selection,Artificial intelligence,Big data,Machine learning,Bayesian probability | Conference |
ISSN | ISBN | Citations |
2639-1589 | 978-1-7281-0859-9 | 0 |
PageRank | References | Authors |
0.34 | 0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Lijiang Geng | 1 | 0 | 0.34 |
Yishu Xue | 2 | 0 | 0.34 |
Guanyu Hu | 3 | 0 | 0.68 |