Title
Privacy-preserving decision tree for epistasis detection.
Abstract
The interaction between gene loci, namely epistasis, is a widespread biological genetic phenomenon. In genome-wide association studies(GWAS), epistasis detection of complex diseases is a major challenge. Although many approaches using statistics, machine learning, and information entropy were proposed for epistasis detection, the privacy preserving for single nucleotide polymorphism(SNP) data has been largely ignored. Thus, this paper proposes a novel two-stage approach. A fusion strategy assists in combining and sorting the SNPs importance scores obtained by the relief and mutual information, thereby obtaining a candidate set of SNPs. This avoids missing some SNPs with strong interaction. Furthermore, differentially private decision tree is applied to search for SNPs. This achieves the efficient epistasis detection of complex diseases on the basis of privacy preserving compared with heuristic methods. The recognition rate on simulation data set is more than 90%. Also, several susceptible loci including rs380390 and rs1329428 are found in the real data set for Age-related Macular Degeneration (AMD). This demonstrates that our method is promising in epistasis detection.
Year
DOI
Venue
2019
10.1186/s42400-019-0025-z
Cybersecurity
Keywords
DocType
Volume
Epistasis, Relief, Mutual information, Decision tree, Differential privacy
Journal
2
Issue
ISSN
Citations 
1
2523-3246
0
PageRank 
References 
Authors
0.34
0
3
Name
Order
Citations
PageRank
Qingfeng Chen1256.74
X. Zhang219043.25
Ruchang Zhang300.34