Abstract | ||
---|---|---|
Given a multiple alignment of orthologous DNA sequences and a phylogenetic tree for these sequences, we investigate the problem of reconstructing the most likely scenario of insertions and deletions capable of explaining the gaps observed in the alignment. This problem, that we called the Indel Maximum Likelihood Problem ( IMLP), is an important step toward the reconstruction of ancestral genomics sequences, and is important for studying evolutionary processes, genome function, adaptation and convergence. We solve the IMLP using a new type of tree hidden Markov model whose states correspond to single-base evolutionary scenarios and where transitions model dependencies between neighboring columns. The standard Viterbi and Forward-backward algorithms are optimized to produce the most likely ancestral reconstruction and to compute the level of confidence associated to specific regions of the reconstruction. A heuristic is presented to make the method practical for large data sets, while retaining an extremely high degree of accuracy. The methods are illustrated on a 1-Mb alignment of the CFTR regions from 12 mammals. |
Year | DOI | Venue |
---|---|---|
2007 | 10.1089/cmb.2007.A006 | JOURNAL OF COMPUTATIONAL BIOLOGY |
Keywords | Field | DocType |
ancestral genome reconstruction,ancestral mammalian genomes,Indel Maximum Likelihood Problem,insertions and deletions,tree-HMM | Heuristic,Data set,Phylogenetic tree,Ancestral reconstruction,Algorithm,Bioinformatics,Hidden Markov model,Multiple sequence alignment,Viterbi algorithm,Mathematics,Indel | Journal |
Volume | Issue | ISSN |
14.0 | 4 | 1066-5277 |
Citations | PageRank | References |
11 | 0.97 | 8 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Abdoulaye Baniré Diallo | 1 | 40 | 9.37 |
V Makarenkov | 2 | 266 | 32.98 |
Mathieu Blanchette | 3 | 631 | 62.65 |