Title
eCOMPASS: evaluative comparison of multiple protein alignments by statistical score
Abstract
Motivation: Detecting subtle biologically relevant patterns in protein sequences often requires the construction of a large and accurate multiple sequence alignment (MSA). Methods for constructing MSAs are usually evaluated using benchmark alignments, which, however, typically contain very few sequences and are therefore inappropriate when dealing with large numbers of proteins. Results: eCOMPASS addresses this problem using a statistical measure of relative alignment quality based on direct coupling analysis (DCA): to maintain protein structural integrity over evolutionary time, substitutions at one residue position typically result in compensating substitutions at other positions. eCOMPASS computes the statistical significance of the congruence between high scoring directly coupled pairs and 3D contacts in corresponding structures, which depends upon properly aligned homologous residues. We illustrate eCOMPASS using both simulated and real MSAs.
Year
DOI
Venue
2021
10.1093/bioinformatics/btab374
BIOINFORMATICS
DocType
Volume
Issue
Journal
37
20
ISSN
Citations 
PageRank 
1367-4803
0
0.34
References 
Authors
0
3
Name
Order
Citations
PageRank
Andrew F Neuwald101.35
Bryan D Kolaczkowski200.34
Stephen F Altschul318026.55