Encrypted Value Iteration and Temporal Difference Learning over Leveled Homomorphic Encryption - Citegraph

Paper Info

Title
Encrypted Value Iteration and Temporal Difference Learning over Leveled Homomorphic Encryption

Abstract
We consider an architecture of confidential cloud-based control synthesis based on Homomorphic Encryption (HE). Our study is motivated by the recent surge of data-driven control such as deep reinforcement learning, whose heavy computational requirements often necessitate an outsourcing to the third party server. To achieve more flexibility than Partially Homomorphic Encryption (PHE) and less computational overhead than Fully Homomorphic Encryption (FHE), we consider a Reinforcement Learning (RL) architecture over Leveled Homomorphic Encryption (LHE). We first show that the impact of the encryption noise under the Cheon-Kim-Kim-Song (CKKS) encryption scheme on the convergence of the model-based tabular Value Iteration (VI) can be analytically bounded. We also consider secure implementations of TD(0), SARSA(0) and Z-learning algorithms over the CKKS scheme, where we numerically demonstrate that the effects of the encryption noise on these algorithms are also minimal.

Year	DOI	Venue
2021	10.23919/ACC50511.2021.9483184	2021 AMERICAN CONTROL CONFERENCE (ACC)
DocType	ISSN	Citations
Conference	0743-1619	0
PageRank	References	Authors
0.34	0	2

Authors (2 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Jihoon Suh	1	0	0.34
Takashi Tanaka	2	34	12.22

1