Rethinking Architecture Design for Tackling Data Heterogeneity in Federated Learning - Citegraph

Paper Info

Title
Rethinking Architecture Design for Tackling Data Heterogeneity in Federated Learning

Abstract
Federated learning is an emerging research paradigm enabling collaborative training of machine learning models among different organizations while keeping data private at each institution. Despite recent progress, there remain fundamental challenges such as the lack of convergence and the potential for catastrophic forgetting across real-world heterogeneous devices. In this paper, we demonstrate that self-attention-based architectures (e.g., Transformers) are more robust to distribution shifts and hence improve federated learning over heterogeneous data. Concretely, we conduct the first rigorous empirical investigation of different neural architectures across a range of federated algorithms, real-world benchmarks, and heterogeneous data splits. Our experiments show that simply replacing convolutional networks with Transformers can greatly reduce catastrophic forgetting of previous devices, accelerate convergence, and reach a better global model, especially when dealing with heterogeneous data. We release our code and pretrained models to encourage future exploration in robust architectures as an alternative to current research efforts on the optimization front.

Year	DOI	Venue
2022	10.1109/CVPR52688.2022.00982	IEEE Conference on Computer Vision and Pattern Recognition
Keywords	DocType	Volume
Privacy and federated learning	Conference	2022
Issue	Citations	PageRank
1	0	0.34
References	Authors
0	8

Authors (8 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Liangqiong Qu	1	0	0.68
Yuyin Zhou	2	97	10.94
Paul Pu Liang	3	94	11.96
Yingda Xia	4	0	0.34
Feifei Wang	5	0	0.34
Li Fei-Fei	6	22483	1135.90
Ehsan Adeli Mosabbeb	7	261	39.27
Daniel L. Rubin	8	1645	145.14

1