Title
Granular Multiple Kernel Learning For Identifying Rna-Binding Protein Residues Via Integrating Sequence And Structure Information
Abstract
RNA-binding proteins play an important role in the biological process. However, the traditional experiment technology to predict RNA-binding residues is time-consuming and expensive, so the development of an effective computational approach can provide a strategy to solve this issue. In recent years, most of the computational approaches are constructed on protein sequence information, but the protein structure has not been considered. In this paper, we use a novel computational model of RNA-binding residues prediction, using protein sequence and structure information. Our hybrid features are encoded by local sequence and structure feature extraction models. Our predictor is built by employing the Granular Multiple Kernel Support Vector Machine with Repetitive Under-sampling (GMKSVM-RU). In order to evaluate our method, we use fivefold cross-validation on the RBP129, our method achieves better experimental performance with MCC of 0.3367 and accuracy of 88.84%. In order to further evaluate our model, an independent data set (RBP60) is employed, and our method achieves MCC of 0.3921 and accuracy of 87.52%. Above results demonstrate that integrating sequence and structure information is beneficial to improve the prediction ability of RNA-binding residues.
Year
DOI
Venue
2021
10.1007/s00521-020-05573-4
NEURAL COMPUTING & APPLICATIONS
Keywords
DocType
Volume
RNA-binding residues, Granular multiple kernel support vector machine, Hexagon, Structure information
Journal
33
Issue
ISSN
Citations 
17
0941-0643
1
PageRank 
References 
Authors
0.35
0
5
Name
Order
Citations
PageRank
Chao Yang18722.49
yihong ding24010.39
Qiaozhen Meng310.35
Jijun Tang437048.23
Fei Guo5165.31