Title
Privacy-preserving Model Training for Disease Prediction Using Federated Learning with Differential Privacy.
Abstract
Machine learning is playing an increasingly critical role in health science with its capability of inferring valuable information from high-dimensional data. More training data provides greater statistical power to generate better models that can help decision-making in healthcare. However, this often requires combining research and patient data across institutions and hospitals, which is not always possible due to privacy considerations. In this paper, we outline a simple federated learning algorithm implementing differential privacy to ensure privacy when training a machine learning model on data spread across different institutions. We tested our model by predicting breast cancer status from gene expression data. Our model achieves a similar level of accuracy and precision as a single-site non-private neural network model when we enforce privacy. This result suggests that our algorithm is an effective method of implementing differential privacy with federated learning, and clinical data scientists can use our general framework to produce differentially private models on federated datasets. Our framework is available at https://github.com/gersteinlab/idash20FL.
Year
DOI
Venue
2022
10.1109/EMBC48229.2022.9871742
Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)
DocType
Volume
ISSN
Conference
2022
2694-0604
Citations 
PageRank 
References 
0
0.34
0
Authors
4
Name
Order
Citations
PageRank
Amol Khanna100.34
Vincent Schaffer200.34
Gamze Gursoy300.34
Mark Gerstein435445.41