Title
CORE:Common Region Extension Based Multiple Protein Structure Alignment for Producing Multiple Solution
Abstract
Over the past several decades, biologists have conducted numerous studies examining both general and specific functions of proteins. Generally, if similarities in either the structure or sequence of amino acids exist for two proteins, then a common biological function is expected. Protein function is determined primarily based on the structure rather than the sequence of amino acids. The algorithm for protein structure alignment is an essential tool for the research. The quality of the algorithm depends on the quality of the similarity measure that is used, and the similarity measure is an objective function used to determine the best alignment. However, none of existing similarity measures became golden standard because of their individual strength and weakness. They require excessive filtering to find a single alignment. In this paper, we introduce a new strategy that finds not a single alignment, but multiple alignments with di?erent lengths. This method has obvious benefits of high quality alignment. However, this novel method leads to a new problem that the running time for this method is considerably longer than that for methods that find only a single alignment. To address this problem, we propose algorithms that can locate a common region (CORE) of multiple alignment candidates, and can then extend the CORE into multiple alignments. Because the CORE can be defined from a final alignment, we introduce CORE* that is similar to CORE and propose an algorithm to identify the CORE*. By adopting CORE* and dynamic programming, our proposed method produces multiple alignments of various lengths with higher accuracy than previous methods. In the experiments, the alignments identified by our algorithm are longer than those obtained by TM-align by 17% and 15.48%, on average, when the comparison is conducted at the level of super-family and fold, respectively.
Year
DOI
Venue
2013
10.1007/s11390-013-1365-x
J. Comput. Sci. Technol.
Keywords
Field
DocType
structure alignment,protein structure,similarity search
Dynamic programming,Structural alignment,Alignment-free sequence analysis,Similarity measure,Computer science,Algorithm,Filter (signal processing),Bioinformatics,Multiple sequence alignment,Nearest neighbor search,Distributed computing,Protein structure
Journal
Volume
Issue
ISSN
28
4
1860-4749
Citations 
PageRank 
References 
0
0.34
14
Authors
3
Name
Order
Citations
PageRank
Woo-Cheol Kim1375.46
Sanghyun Park272980.64
Jung-Im Won38610.56