Title
A secure SNP panel scheme using homomorphically encrypted K-mers without SNP calling on the user side.
Abstract
Single Nucleotide Polymorphism (SNP) in the genome has become crucial information for clinical use. For example, the targeted cancer therapy is primarily based on the information which clinically important SNPs are detectable from the tumor. Many hospitals have developed their own panels that include clinically important SNPs. The genome information exchange between the patient and the hospital has become more popular. However, the genome sequence information is innate and irreversible and thus its leakage has serious consequences. Therefore, protecting one’s genome information is critical. On the other side, hospitals may need to protect their own panels. There is no known secure SNP panel scheme to protect both. In this paper, we propose a secure SNP panel scheme using homomorphically encrypted K-mers without requiring SNP calling on the user side and without revealing the panel information to the user. Use of the powerful homomorphic encryption technique is desirable, but there is no known algorithm to efficiently align two homomorphically encrypted sequences. Thus, we designed and implemented a novel secure SNP panel scheme utilizing the computationally feasible equality test on two homomorphically encrypted K-mers. To make the scheme work correctly, in addition to SNPs in the panel, sequence variations at the population level should be addressed. We designed a concept of Point Deviation Tolerance (PDT) level to address the false positives and false negatives. Using the TCGA BRCA dataset, we demonstrated that our scheme works at the level of over a hundred thousand somatic mutations. In addition, we provide a computational guideline for the panel design, including the size of K-mer and the number of SNPs. The proposed method is the first of its kind to protect both the user’s sequence and the hospital’s panel information using the powerful homomorphic encryption scheme. We demonstrated that the scheme works with a simulated dataset and the TCGA BRCA dataset. In this study, we have shown only the feasibility of the proposed scheme and much more efforts should be done to make the scheme usable for clinical use.
Year
DOI
Venue
2019
10.1186/s12864-019-5473-z
BMC Genomics
Keywords
DocType
Volume
SNP panel, Homomorphic encryption, K-mer, Genomic security, Genomic privacy
Journal
2
Issue
ISSN
Citations 
suppl
1471-2164
0
PageRank 
References 
Authors
0.34
0
8
Name
Order
Citations
PageRank
Sungjoon Park100.34
Minsu Kim200.34
Seokjun Seo300.34
Seungwan Hong4124.70
Kyoohyung Han5577.43
Keewoo Lee6163.02
Jung Hee Cheon71787129.74
Sun Kim8185.62