Title
Dynamic Workload Allocation for Edge Computing
Abstract
Artificial intelligence models implemented in power-efficient Internet-of-Things (IoT) devices have accuracy degradation due to limited power consumption. To mitigate the accuracy loss on IoT devices, an edge-server joint inference system is introduced. On the edge-server inference system, allocate more workloads to the server end can mitigate accuracy loss, but data transmission contributes to the power consumption of the edge device. Thus, in this article, we present a novel two-stage method to allocate workloads to the server or the edge to maximize inference accuracy under a power constraint. In the first stage, we present a clusterwise threshold-based method for estimating the trustworthiness of a prediction made at the edge. In the second stage, we further determine the workload allocation of a trustworthy image based on the probability of the top 1 prediction and the power constraint. In addition, we propose a fine-tuning process to the pretrained model at the edge for achieving better accuracy. In the experiments, we apply the proposed method to several well-known deep neural network models. The results show that the proposed method can improve inference accuracy up to 3.93% under a specific power constraint compared to previous methods.
Year
DOI
Venue
2021
10.1109/TVLSI.2021.3049520
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Keywords
DocType
Volume
Artificial intelligence,authentic operation (AO),deep neural network (DNN),Internet of Things (IoT),workload allocation
Journal
29
Issue
ISSN
Citations 
3
1063-8210
1
PageRank 
References 
Authors
0.35
0
5
Name
Order
Citations
PageRank
Yi-Wen Hung110.35
Yung-Chih Chen241339.89
Chi Lo310.35
Austin Go So410.35
Shih-Chieh Chang564152.31