Title
ELSSI: parallel SNP-SNP interactions detection by ensemble multi-type detectors
Abstract
With the development of high-throughput genotyping technology, single nucleotide polymorphism (SNP)-SNP interactions (SSIs) detection has become an essential way for understanding disease susceptibility. Various methods have been proposed to detect SSIs. However, given the disease complexity and bias of individual SSI detectors, these single-detector-based methods are generally unscalable for real genome-wide data and with unfavorable results. We propose a novel ensemble learning-based approach (ELSSI) that can significantly reduce the bias of individual detectors and their computational load. ELSSI randomly divides SNPs into different subsets and evaluates them by multi-type detectors in parallel. Particularly, ELSSI introduces a four-stage pipeline (generate, score, switch and filter) to iteratively generate new SNP combination subsets from SNP subsets, score the combination subset by individual detectors, switch high-score combinations to other detectors for re-scoring, then filter out combinations with low scores. This pipeline makes ELSSI able to detect high-order SSIs from large genome-wide datasets. Experimental results on various simulated and real genome-wide datasets show the superior efficacy of ELSSI to state-of-the-art methods in detecting SSIs, especially for high-order ones. ELSSI is applicable with moderate PCs on the Internet and flexible to assemble new detectors. The code of ELSSI is available at https://www.sdu-idea.cn/codes.php?name=ELSSI.
Year
DOI
Venue
2022
10.1093/bib/bbac213
BRIEFINGS IN BIOINFORMATICS
Keywords
DocType
Volume
SNP-SNP interactions, multi-type detectors, ensemble learning, bias, divide and conquer
Journal
23
Issue
ISSN
Citations 
4
1467-5463
0
PageRank 
References 
Authors
0.34
0
6
Name
Order
Citations
PageRank
Xin Wang1018.25
Xia Cao200.34
Yuantao Feng300.34
Mao-Zu Guo452653.96
Guoxian Yu523421.81
Jun Wang667.52