Abstract | ||
---|---|---|
An increasing number of algorithms for biochemical network inference from experimental data require discrete data as input. For example, dynamic Bayesian network methods and methods that use the framework of finite dynamical systems, such as Boolean networks, all take discrete input. Experimental data, however, are typically continuous and represented by computer floating point numbers. The translation from continuous to discrete data is crucial in preserving the variable dependencies and thus has a significant impact on the performance of the network inference algorithms. We compare the performance of two such algorithms that use discrete data using several different discretization algorithms. One of the inference methods uses a dynamic Bayesian network framework, the other-a time- and state-discrete dynamical system framework. The discretization algorithms are quantile, interval discretization, and a new algorithm introduced in this article, SSD. SSD is especially designed for short time series data and is capable of determining the optimal number of discretization states. The experiments show that both inference methods perform better with SSD than with the other methods. In addition, SSD is demonstrated to preserve the dynamic features of the time series, as well as to be robust to noise in the experimental data. A C++ implementation of SSD is available from the authors at http://polymath.vbi.vt.edu/discretization. |
Year | DOI | Venue |
---|---|---|
2010 | 10.1089/cmb.2008.0023 | JOURNAL OF COMPUTATIONAL BIOLOGY |
Keywords | Field | DocType |
gene networks,genetic algorithms,linear algebra,reverse engineering,time discrete dynamical systems | Time series,Discretization,Computer science,Inference,Floating point,Theoretical computer science,Dynamical systems theory,Artificial intelligence,Machine learning,Genetic algorithm,Discretization of continuous features,Dynamic Bayesian network | Journal |
Volume | Issue | ISSN |
17.0 | 6 | 1066-5277 |
Citations | PageRank | References |
9 | 0.90 | 18 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Elena S. Dimitrova | 1 | 29 | 5.36 |
M. Paola Vera Licona | 2 | 9 | 0.90 |
John McGee | 3 | 101 | 14.38 |
Reinhard Laubenbacher | 4 | 179 | 19.01 |