Title
Genome phylogeny based on short-range correlations in DNA sequences.
Abstract
The surprising fact that global statistical properties computed on a genomewide scale may reveal species information has first been observed in studies of dinucleotide frequencies. Here we will look at the same phenomenon with a totally different statistical approach. We show that patterns in the short-range statistical correlations in DNA sequences serve as evolutionary fingerprints of eukaryotes. All chromosomes of a species display the same characteristic pattern, markedly different from those of other species. The chromosomes of a species are sorted onto the same branch of a phylogenetic tree due to this correlation pattern. The average correlation between nucleotides at a distance k is quantified in two independent ways: (i) by estimating it from a higher-order Markov process and (ii) by computing the mutual information function at a distance k. We show how the quality of phylogenetic reconstruction depends on the range of correlation strengths and on the length of the underlying sequence segment. This concept of the correlation pattern as a phylogenetic signature of eukaryote species combines two rather distant domains of research, namely phylogenetic analysis based on molecular observation and the study of the correlation structure of DNA sequences.
Year
DOI
Venue
2005
10.1089/cmb.2005.12.545
JOURNAL OF COMPUTATIONAL BIOLOGY
Keywords
Field
DocType
information theory,eukaryote genomes,species distinction
Genome,Phylogenetic tree,Chromosome,Biology,Mutual information,Computational phylogenetics,DNA sequencing,Bioinformatics,Phylogenetics,Phylogenetic network
Journal
Volume
Issue
ISSN
12.0
5
1066-5277
Citations 
PageRank 
References 
4
1.01
2
Authors
4
Name
Order
Citations
PageRank
Manuel Dehnert141.68
Rainer Plaumann241.01
Werner E Helm342.02
Marc-Th Hütt441.01