Title
A polynomial-time algorithm for a class of protein threading problems.
Abstract
This paper presents an algorithm for constructing an optimal alignment between a three-dimensional protein structure template and an amino acid sequence. A protein structure template is given as a sequence of amino acid residue positions in three-dimensional space, along with an array of physical properties attached to each position; these residue positions are sequentially grouped into a series of core secondary structures (central helices and beta sheets). In addition to match scores and gap penalties, as in a traditional sequence-sequence alignment problem, the quality of a structure-sequence alignment is also detemined by interaction preferences among amino acids aligned with structure positions that are spatially close (we call these 'long-range interactions'). Although it is known that constructing such a structure-sequence alignment in the most general form is NP-hard, our algorithm runs in polynomial time when restricted to structures with a 'modest' number of long-range amino acid interactions. In the current work, long-range interactions are limited to interactions between amino acids from different cove secondary structures. Dividing the series of core secondary structures into two subseries creates a cut set of long-range interactions. If we use N, M and C to represent the size of an amino acid sequence, the size of a structure template, and the maximum cut size of long-range interactions respectively, the algorithm finds an optimal structure-sequence alignment in O(21(C)NM) time, a polynomial function of N and M when C = O(log(N + M)). When running on structure-sequence alignment problems without long-range intersections, i.e. C = O, the algorithm achieves the same asymptotic computational complexity of the Smith-Waterman sequence-sequence alignment algorithm.
Year
DOI
Venue
1996
10.1093/bioinformatics/12.6.511
COMPUTER APPLICATIONS IN THE BIOSCIENCES
Keywords
Field
DocType
computational complexity,protein threading,amino acid,amino acid sequence,polynomial time,physical properties,protein structure,secondary structure,sequence alignment
Sequence alignment,Cut,Structural alignment,Combinatorics,Polynomial,Threading (protein sequence),Algorithm,Bioinformatics,Time complexity,Mathematics,Maximum cut,Protein structure
Journal
Volume
Issue
ISSN
12
6
0266-7061
Citations 
PageRank 
References 
6
3.17
0
Authors
2
Name
Order
Citations
PageRank
Y Xu19346.16
Edward C. Uberbacher222186.43