Title
A Physicochemical Model for Analyzing DNA Sequences.
Abstract
In search of an ab initio model to characterize DNA sequences as genes and nonorenes, we examined some physicochemical properties of each trinucleotide (codon), which could accomplish this task. We constructed three-dimensional vectors for each double-helical trinucleotide sequence considering hydrogen-bonding energy, stacking energy, and a third parameter, which we provisionally identified with DNA-protein interactions. As this three-dimensional vector moves along any genome, the net orientation of the resultant vector should differ significantly for gene and nongene regions to make a distinction feasible, if the underlying model has some merits. An analysis of 331 prokaryotic genomes comprising a total of 294 786 experimentally verified genes (nonoverlapping) and an equal number of nongenes presents a proof of concept of the model without the need for further parametrization. Also, initial analyses on Saccharonzyces cerevisiae and Arabidopsis thaliana suggest that the methodology is extendable to eukaryotes. The physicochemical model (CheinGenome1.0) introduced has the potential to be developed into a gene-finding algorithm and, more pressingly, could be employed for an independent assessment of the annotation of DNA sequences.
Year
DOI
Venue
2006
10.1021/ci050119x
JOURNAL OF CHEMICAL INFORMATION AND MODELING
Keywords
DocType
Volume
dna sequence
Journal
46
Issue
ISSN
Citations 
1
1549-9596
2
PageRank 
References 
Authors
0.44
17
7
Name
Order
Citations
PageRank
Samrat Dutta1102.62
Poonam Singhal220.78
Praveen Agrawal321.12
Raju Tomer420.44
Kritee Kritee520.44
Ekta Khurana6202.20
B Jayaram7417.36