Title
Improving promoter prediction Improving promoter prediction for the NNPP2.2 algorithm: a case study using Escherichia coli DNA sequences
Abstract
Motivation: Although a great deal of research has been undertaken in the area of promoter prediction, prediction techniques are still not fully developed. Many algorithms tend to exhibit poor specificity, generating many false positives, or poor sensitivity. The neural network prediction program NNPP2.2 is one such example. Results: To improve the NNPP2.2 prediction technique, the distance between the transcription start site (TSS) associated with the promoter and the translation start site (TLS) of the subsequent gene coding region has been studied for Escherichia coli K12 bacteria. An empirical probability distribution that is consistent for all E.coli promoters has been established. This information is combined with the results from NNPP2.2 to create a new technique called TLS--NNPP, which improves the specificity of promoter prediction. The technique is shown to be effective using E.coli DNA sequences, however, it is applicable to any organism for which a set of promoters has been experimentally defined. Availability: The data used in this project and the prediction results for the tested sequences can be obtained from http://www.uow.edu.au/~yanxia/E_Coli_paper/SBurden_Results.xls Contact: alh98@uow.edu.au
Year
DOI
Venue
2005
10.1093/bioinformatics/bti047
Bioinformatics
Keywords
Field
DocType
poor specificity,case study,e.coli promoter,improving promoter prediction improving,prediction technique,poor sensitivity,escherichia coli dna sequence,prediction result,e.coli dna sequence,promoter prediction,transcription start site,neural network prediction program,new technique,dna sequence,heterodox economics,climate change,escherichia coli,ecological economics
Sequence alignment,Promoter,Gene,Transcription (biology),Nucleic acid sequence,Computer science,Coding region,Algorithm,DNA sequencing,Bioinformatics,False positive paradox
Journal
Volume
Issue
ISSN
21
5
1367-4803
Citations 
PageRank 
References 
15
1.15
13
Authors
3
Name
Order
Citations
PageRank
S. Burden1151.15
Y.-X. Lin2151.15
Ren Zhang3182.75