Title
Scalable pseudo-likelihood estimation in hybrid random fields
Abstract
Learning probabilistic graphical models from high-dimensional datasets is a computationally challenging task. In many interesting applications, the domain dimensionality is such as to prevent state-of-the-art statistical learning techniques from delivering accurate models in reasonable time. This paper presents a hybrid random field model for pseudo-likelihood estimation in high-dimensional domains. A theoretical analysis proves that the class of pseudo-likelihood distributions representable by hybrid random fields strictly includes the class of joint probability distributions representable by Bayesian networks. In order to learn hybrid random fields from data, we develop the Markov Blanket Merging algorithm. Theoretical and experimental evidence shows that Markov Blanket Merging scales up very well to high-dimensional datasets. As compared to other widely used statistical learning techniques, Markov Blanket Merging delivers accurate results in a number of link prediction tasks, while achieving also significant improvements in terms of computational efficiency. Our software implementation of the models investigated in this paper is publicly available at http://www.dii.unisi.it/~freno/. The same website also hosts the datasets used in this work that are not available elsewhere in the same preprocessing used for our experiments.
Year
DOI
Venue
2009
10.1145/1557019.1557059
KDD
Keywords
Field
DocType
accurate model,markov blanket merging,statistical learning technique,markov blanket merging scale,state-of-the-art statistical learning technique,hybrid random field model,high-dimensional datasets,hybrid random field,markov blanket merging algorithm,scalable pseudo-likelihood estimation,high-dimensional domain,modularity,bayesian network,random field,bayesian networks,scalability,probability distribution
Data mining,Random field,Joint probability distribution,Computer science,Curse of dimensionality,Preprocessor,Bayesian network,Markov blanket,Artificial intelligence,Graphical model,Machine learning,Scalability
Conference
Citations 
PageRank 
References 
9
0.73
18
Authors
3
Name
Order
Citations
PageRank
Antonino Freno1435.94
Edmondo Trentin228629.25
Marco Gori370534.54