Abstract | ||
---|---|---|
Mining high dimensional data-sets extracted from real world problems is a challenging task due to the large features' space. The latent variables are used to reduce the dimensions of this space by representing highly dependent features. They simplify the creation of probabilistic models and they clarify the semantic of the inferred knowledge. Learning these variables for Bayesian network, as the most generic probabilistic model, is problematic. Actually, there is not a direct way that leads to finding their cardinalities. The precision of the inferred model is highly dependent on the accuracy of the latent variable's cardinality. Therefore, choosing a small value leads to a generalized model having a high rate of information loss. Moreover, a high cardinality tend to over-fit the data, to generate complex latent variables and to burden the parameter learning of the probabilistic model. In this paper, we propose a new criterion based on the mutual information and the log likelihood, called the equilibrium criterion. We mathematically and experimentally validate its efficiency for estimating the cardinality of the latent variable. We also demonstrate its performance in finding the hidden cause of a set of observed variables. The experimental analysis shows that our method succeeded in restoring the original cardinality of intentionally deleted variables in known networks. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1109/ICTAI.2015.138 | IEEE International Conference on Tools with Artificial Intelligence |
Keywords | Field | DocType |
Latent variable, Bayesian network, mutual information, log likelihood | Pattern recognition,Computer science,Cardinality,Latent class model,Latent variable,Bayesian network,Statistical model,Artificial intelligence,Probabilistic latent semantic analysis,Mutual information,Probabilistic logic,Machine learning | Conference |
ISSN | Citations | PageRank |
1082-3409 | 1 | 0.35 |
References | Authors | |
15 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Hasna Njah | 1 | 12 | 1.88 |
Salma Jamoussi | 2 | 50 | 19.98 |
Walid Mahdi | 3 | 116 | 25.49 |
Afif Masmoudi | 4 | 50 | 10.25 |