Title
DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites.
Abstract
Motivation: Proteases are enzymes that cleave target substrate proteins by catalyzing the hydrolysis of peptide bonds between specific amino acids. While the functional proteolysis regulated by proteases plays a central role in the 'life and death' cellular processes, many of the corresponding substrates and their cleavage sites were not found yet. Availability of accurate predictors of the substrates and cleavage sites would facilitate understanding of proteases' functions and physiological roles. Deep learning is a promising approach for the development of accurate predictors of substrate cleavage events. Results: We propose DeepCleave, the first deep learning-based predictor of protease-specific substrates and cleavage sites. DeepCleave uses protein substrate sequence data as input and employs convolutional neural networks with transfer learning to train accurate predictive models. High predictive performance of our models stems from the use of high-quality cleavage site features extracted from the substrate sequences through the deep learning process, and the application of transfer learning, multiple kernels and attention layer in the design of the deep network. Empirical tests against several related state-of-the-art methods demonstrate that DeepCleave outperforms these methods in predicting caspase and matrix metalloprotease substrate-cleavage sites.
Year
DOI
Venue
2020
10.1093/bioinformatics/btz721
BIOINFORMATICS
Field
DocType
Volume
Matrix Metalloprotease,Computer science,Cell biology,Bioinformatics,Caspase,Cleavage (embryo)
Journal
36
Issue
ISSN
Citations 
4
1367-4803
2
PageRank 
References 
Authors
0.35
0
12
Name
Order
Citations
PageRank
Fuyi Li19711.25
Jinxiang Chen281.81
André Leier330.70
Tatiana T. Marquez-Lago4779.01
Quanzhong Liu571.11
Yanze Wang620.35
Jerico Revote751.47
A. Ian Smith8322.88
Tatsuya Akutsu92169216.05
Geoffrey I. Webb109912.05
Lukasz Kurgan1172.47
Jiangning Song1237441.93