Title
Data Transfer Optimization Based on Offline Knowledge Discovery and Adaptive Real-time Sampling.
Abstract
The amount of data moved over dedicated and non-dedicated network links increases much faster than the increase in the network capacity, but the current solutions fail to guarantee even the promised achievable transfer throughputs. In this paper, we propose a novel dynamic throughput optimization model based on mathematical modeling with offline knowledge discovery/analysis and adaptive online decision making. In offline analysis, we mine historical transfer logs to perform knowledge discovery about the transfer characteristics. Online phase uses the discovered knowledge from the offline analysis along with real-time investigation of the network condition to optimize the protocol parameters. As real-time investigation is expensive and provides partial knowledge about the current network status, our model uses historical knowledge about the network and data to reduce the real-time investigation overhead while ensuring near optimal throughput for each transfer. Our network and data agnostic solution is tested over different networks and achieved up to 93% accuracy compared with the optimal achievable throughput possible on those networks.
Year
Venue
Field
2017
arXiv: Distributed, Parallel, and Cluster Computing
Data mining,Data transmission,Computer science,Offline analysis,Sampling (statistics),Online decision making,Knowledge extraction,Throughput,Distributed computing
DocType
Volume
Citations 
Journal
abs/1707.09455
0
PageRank 
References 
Authors
0.34
15
6
Name
Order
Citations
PageRank
M. D. S. Q. Zulkar Nine160.81
Kemal Guner261.49
Ziyun Huang352.49
Xiangyu Wang47623.91
Jinhui Xu566578.86
Kosar, Tevfik661448.67