Abstract | ||
---|---|---|
In this paper we propose a new method for recognition of prokaryotic promoter regions with startpoints of transcription. The method is based on Sequence Alignment Kernel, a function reflecting the quantitative measure of match between two sequences. This kernel function is further used in Dual SVM, which performs the recognition. Several recognition methods have been trained and tested on positive data set, consisting of 669 sigma(70)-promoter regions with known transcription startpoints of Escherichia coli and two negative data sets of 709 examples each, taken from coding and non-coding regions of the same genome. The results show that our method performs well and achieves 16.5% average error rate on positive & coding negative data and 18.6% average error rate on positive & non-coding negative data. |
Year | DOI | Venue |
---|---|---|
2003 | 10.1093/bioinformatics/btg265 | BIOINFORMATICS |
Keywords | Field | DocType |
kernel function,error rate,escherichia coli,sequence alignment | Sequence alignment,Kernel (linear algebra),Data set,Computer science,Word error rate,Support vector machine,Coding (social sciences),Bioinformatics,Insertion sequence,Kernel (statistics) | Journal |
Volume | Issue | ISSN |
19 | 15.0 | 1367-4803 |
Citations | PageRank | References |
29 | 2.47 | 20 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Leo Gordon | 1 | 30 | 2.83 |
Alexey Ya. Chervonenkis | 2 | 53 | 6.75 |
Alex J. Gammerman | 3 | 62 | 5.97 |
Ilham A. Shahmuradov | 4 | 77 | 6.75 |
Victor V. Solovyev | 5 | 193 | 35.93 |