Title
Smi-Blast: A Novel Supervised Search Framework Based On Psi-Blast For Protein Remote Homology Detection
Abstract
Motivation: As one of the most important and widely used mainstream iterative search tool for protein sequence search, an accurate Position-Specific Scoring Matrix (PSSM) is the key of PSI-BLAST. However, PSSMs containing non-homologous information obviously reduce the performance of PSI-BLAST for protein remote homology.Results: To further study this problem, we summarize three types of Incorrectly Selected Homology (ISH) errors in PSSMs. A new search tool Supervised-Manner-based Iterative BLAST (SMI-BLAST) is proposed based on PSIBLAST for solving these errors. SMI-BLAST obviously outperforms PSI-BLAST on the Structural Classification of Proteins-extended (SCOPe) dataset. Compared with PSI-BLAST on the ISH error subsets of SCOPe dataset, SMIBLAST detects 1.6-2.87 folds more remote homologous sequences, and outperforms PSI-BLAST by 35.66% in terms of ROC1 scores. Furthermore, this framework is applied to JackHMMER, DELTA-BLAST and PSI-BLASTexB, and their performance is further improved.
Year
DOI
Venue
2021
10.1093/bioinformatics/btaa772
BIOINFORMATICS
DocType
Volume
Issue
Journal
37
7
ISSN
Citations 
PageRank 
1367-4803
1
0.35
References 
Authors
0
5
Name
Order
Citations
PageRank
Xiaopeng Jin111.36
Qing Liao23711.60
Hang Wei3102.20
Jun Zhang410.69
Bin Liu541933.30