Title
Compact Genetic Algorithm-Based Feature Selection for Sequence-Based Prediction of Dengue–Human Protein Interactions
Abstract
Dengue Virus (DENV) infection is one of the rapidly spreading mosquito-borne viral infections in humans. Every year, around 50 million people get affected by DENV infection, resulting in 20,000 deaths. Despite the recent experiments focusing on dengue infection to understand its functionality in the human body, several functionally important DENV-human protein-protein interactions (PPIs) have remained unrecognized. This article presents a model for predicting new DENV-human PPIs by combining different sequence-based features of human and dengue proteins like the amino acid composition, dipeptide composition, conjoint triad, pseudo amino acid composition, and pairwise sequence similarity between dengue and human proteins. A Learning vector quantization (LVQ)-based Compact Genetic Algorithm (CGA) model is proposed for feature subset selection. CGA is a probabilistic technique that simulates the behavior of a Genetic Algorithm (GA) with lesser memory and time requirements. Prediction of DENV-human PPIs is performed by the weighted Random Forest (RF) technique as it is found to perform better than other classifiers. We have predicted 1013 PPIs between 335 human proteins and 10 dengue proteins. All predicted interactions are validated by literature filtering, GO-based assessment, and KEGG Pathway enrichment analysis. This study will encourage the identification of potential targets for more effective anti-dengue drug discovery.
Year
DOI
Venue
2022
10.1109/TCBB.2021.3066597
IEEE/ACM Transactions on Computational Biology and Bioinformatics
Keywords
DocType
Volume
Algorithms,Amino Acids,Animals,Dengue Virus,Humans,Proteins
Journal
19
Issue
ISSN
Citations 
4
1545-5963
0
PageRank 
References 
Authors
0.34
19
2
Name
Order
Citations
PageRank
Lopamudra Dey100.34
Anirban Mukhopadhyay271150.07