Title
The OARF Benchmark Suite: Characterization and Implications for Federated Learning Systems
Abstract
This article presents and characterizes an Open Application Repository for Federated Learning (OARF), a benchmark suite for federated machine learning systems. Previously available benchmarks for federated learning (FL) have focused mainly on synthetic datasets and use a limited number of applications. OARF mimics more realistic application scenarios with publicly available datasets as different data silos in image, text, and structured data. Our characterization shows that the benchmark suite is diverse in data size, distribution, feature distribution, and learning task complexity. The extensive evaluations with reference implementations show the future research opportunities for important aspects of FL systems. We have developed reference implementations, and evaluated the important aspects of FL, including model accuracy, communication cost, throughput, and convergence time. Through these evaluations, we discovered some interesting findings such as FL can effectively increase end-to-end throughput. The code of OARF is publicly available on GitHub.(1)
Year
DOI
Venue
2022
10.1145/3510540
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY
Keywords
DocType
Volume
Federated learning, machine learning, benchmark, dataset, framework
Journal
13
Issue
ISSN
Citations 
4
2157-6904
0
PageRank 
References 
Authors
0.34
0
6
Name
Order
Citations
PageRank
Hu Sixu100.34
Li Yuan200.34
X.L. Liu31111.83
Qinbin Li494.04
Zhaomin Wu500.68
Bingsheng He62810179.09