Title
Prediction of liquid-liquid phase separating proteins using machine learning
Abstract
Background: The liquid-liquid phase separation (LLPS) of biomolecules in cell underpins the formation of membraneless organelles, which are the condensates of protein, nucleic acid, or both, and play critical roles in cellular function. Dysregulation of LLPS is implicated in a number of diseases. Although the LLPS of biomolecules has been investigated intensively in recent years, the knowledge of the prevalence and distribution of phase separation proteins (PSPs) is still lag behind. Development of computational methods to predict PSPs is therefore of great importance for comprehensive understanding of the biological function of LLPS. Results: Based on the PSPs collected in LLPSDB, we developed a sequence-based prediction tool for LLPS proteins (PSPredictor), which is an attempt at general purpose of PSP prediction that does not depend on specific protein types. Our method combines the componential and sequential information during the protein embedding stage, and, adopts the machine learning algorithm for final predicting. The proposed method achieves a tenfold cross-validation accuracy of 94.71%, and outperforms previously reported PSPs prediction tools. For further applications, we built a user-friendly PSPredictor web server (http://www.pkumdl.cn/PSPredictor), which is accessible for prediction of potential PSPs. Conclusions: PSPredictor could identifie novel scaffold proteins for stress granules and predict PSPs candidates in the human genome for further study. For further applications, we built a user-friendly PSPredictor web server (http://www.pkumdl.cn/PSPre dictor), which provides valuable information for potential PSPs recognition.
Year
DOI
Venue
2022
10.1186/s12859-022-04599-w
BMC BIOINFORMATICS
Keywords
DocType
Volume
Liquid-liquid phase separation (LLPS), Phase separation proteins (PSPs), Machine learning, Predictor
Journal
23
Issue
ISSN
Citations 
1
1471-2105
0
PageRank 
References 
Authors
0.34
2
7
Name
Order
Citations
PageRank
Xiaoquan Chu101.01
Tanlin Sun2211.41
Qian Li315.77
Youjun Xu4202.58
Zhuqing Zhang5122.01
Luhua Lai636933.78
Jianfeng Pei7373.99