Title
Context dependent substitution biases vary within the human genome.
Abstract
BACKGROUND: Models of sequence evolution typically assume that different nucleotide positions evolve independently. This assumption is widely appreciated to be an over-simplification. The best known violations involve biases due to adjacent nucleotides. There have also been suggestions that biases exist at larger scales, however this possibility has not been systematically explored. RESULTS: To address this we have developed a method which identifies over- and under-represented substitution patterns and assesses their overall impact on the evolution of genome composition. Our method is designed to account for biases at smaller pattern sizes, removing their effects. We used this method to investigate context bias in the human lineage after the divergence from chimpanzee. We examined bias effects in substitution patterns between 2 and 5 bp long and found significant effects at all sizes. This included some individual three and four base pair patterns with relatively large biases. We also found that bias effects vary across the genome, differing between transposons and non-transposons, between different classes of transposons, and also near and far from genes. CONCLUSIONS: We found that nucleotides beyond the immediately adjacent one are responsible for substantial context effects, and that these biases vary across the genome.
Year
DOI
Venue
2010
10.1186/1471-2105-11-462
BMC Bioinformatics
Keywords
DocType
Volume
human genome,cpg,computational biology,context dependent,base pair,bioinformatics,algorithms,patterns,methylation,nucleotides,oligonucleotides,maximum likelihood,microarrays,context effect
Journal
11
Issue
ISSN
Citations 
1
1471-2105
11
PageRank 
References 
Authors
0.47
8
5
Name
Order
Citations
PageRank
P. Andrew Nevarez1110.47
Christopher M. DeBoever2110.47
Benjamin J. Freeland3110.47
Marissa A. Quitt4110.47
Eliot C. Bush5152.06