Title
A bank of protein family patterns for rapid identification of possible functions of amino acid sequences.
Abstract
A method and software tool to develop patterns of protein families has been designed. These patterns are intended for the identification of local similarities in arbitrary amino acid sequences with proteins of the SWISS-PROT bank. The method is based on the physical, chemical and structural properties of amino acids. it assembles a 'best set' of elements (a pattern) for a given group of aligned related proteins. These elements provide discrimination between proteins of a family and representatives of other families or random sequences. The method combines the advantages of BLOCKS (automatic generation of multiple elements for protein groups), PROSITE (simplicity of element presentation) and matrices/profiles (different distinctions between amino acids for different positions of aligned sequences). Using our method a data bank of protein family patterns, PROF_PAT, is produced. This data bank is based on the 27 752 amino acid sequences of SWISS-PROT bank release 24. The characteristics of patterns of 743 related protein groups are described. The results of comparisons of PROF_PAT patterns with the proteins of the SWISS-PROT bank ale discussed.
Year
DOI
Venue
1997
10.1093/bioinformatics/13.2.115
COMPUTER APPLICATIONS IN THE BIOSCIENCES
Keywords
Field
DocType
random sequence,amino acid sequence,protein family,amino acid
Software tool,Data bank,Protein family,Amino acid,Computer science,Homology (biology),Bioinformatics,PROSITE
Journal
Volume
Issue
ISSN
13
2
0266-7061
Citations 
PageRank 
References 
3
1.09
5
Authors
5