Title
Incorporating significant amino acid pairs and protein domains to predict RNA splicing-related proteins with functional roles.
Abstract
Machinery of pre-mRNA splicing is carried out through the interaction of RNA sequence elements and a variety of RNA splicing-related proteins (SRPs) (e.g. spliceosome and splicing factors). Alternative splicing, which is an important post-transcriptional regulation in eukaryotes, gives rise to multiple mature mRNA isoforms, which encodes proteins with functional diversities. However, the regulation of RNA splicing is not yet fully elucidated, partly because SRPs have not yet been exhaustively identified and the experimental identification is labor-intensive. Therefore, we are motivated to design a new method for identifying SRPs with their functional roles in the regulation of RNA splicing. The experimentally verified SRPs were manually curated from research articles. According to the functional annotation of Splicing Related Gene Database, the collected SRPs were further categorized into four functional groups including small nuclear Ribonucleoprotein, Splicing Factor, Splicing Regulation Factor and Novel Spliceosome Protein. The composition of amino acid pairs indicates that there are remarkable differences among four functional groups of SRPs. Then, support vector machines (SVMs) were utilized to learn the predictive models for identifying SRPs as well as their functional roles. The cross-validation evaluation presents that the SVM models trained with significant amino acid pairs and functional domains could provide a better predictive performance. In addition, the independent testing demonstrates that the proposed method could accurately identify SRPs in mammals/plants as well as effectively distinguish between SRPs and RNA-binding proteins. This investigation provides a practical means to identifying potential SRPs and a perspective for exploring the regulation of RNA splicing.
Year
DOI
Venue
2014
10.1007/s10822-014-9706-6
Journal of computer-aided molecular design
Keywords
DocType
Volume
RNA splicing,Spliceosome,Splicing-related protein,Amino acid pair composition,Support vector machine
Journal
28
Issue
ISSN
Citations 
1
1573-4951
0
PageRank 
References 
Authors
0.34
17
5
Name
Order
Citations
PageRank
Justin Bo-Kai Hsu11086.69
Kai-Yao Huang21157.91
Tzu-Ya Weng3205.40
Chien-Hsun Huang481.49
Tzong-Yi Lee561737.18