Certified Neural Network Watermarks with Randomized Smoothing. - Citegraph

Paper Info

Title
Certified Neural Network Watermarks with Randomized Smoothing.

Abstract
Watermarking is a commonly used strategy to protect creators’ rights to digital images, videos and audio. Recently, watermarking methods have been extended to deep learning models – in principle, the watermark should be preserved when an adversary tries to copy the model. However, in practice, watermarks can often be removed by an intelligent adversary. Several papers have proposed watermarking methods that claim to be empirically resistant to different types of removal attacks, but these new techniques often fail in the face of new or better-tuned adversaries. In this paper, we propose the first certifiable watermarking method. Using the randomized smoothing technique, we show that our watermark is guaranteed to be unremovable unless the model parameters are changed by more than a certain $\ell_2$ threshold. In addition to being certifiable, our watermark is also empirically more robust compared to previous watermarking methods.

Year	Venue	DocType
2022	International Conference on Machine Learning	Conference
ISSN	Citations	PageRank
ICML 2022	0	0.34
References	Authors
0	8

Authors (8 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Arpit Bansal	1	0	0.34
Ping-Yeh Chiang	2	1	2.72
Curry Michael J.	3	0	2.37
Rajiv Jain	4	0	0.34
Curtis Wigington	5	6	3.17
Varun Manjunatha	6	68	6.43
John P. Dickerson	7	256	37.97
Tom Goldstein	8	5	5.84

1