Title
A study of hotspot data prediction model in I/O workloads
Abstract
There is a notable characteristic of the data access pattern: 80% I/O requests only access 20% data. This feature brings about the concept of hotspot data, which refer to the data in the most frequent requested area. The access to these hotspot data has direct influence upon the performance of the storage system's applications. Therefore, how to predict hotspot data is a critical research focus in the optimization of the storage system. In this paper, we propose a hotspot data prediction model based on a Zipf-like distribution, which can estimate and dynamically adjust parameters according to the present statistics of I/O access. We classify the hotspot data from every trace, and analyse the prediction rate through the classified hotspot data's characteristic. We synthesize the analysis results in different time granularities and hotspot data prediction queue lengths. Finally, we use block I/O traces to discuss the effectiveness of this model. The discussion and analysis results indicate that this model can predict the hotspot data efficiently.
Year
DOI
Venue
2014
10.1080/00207160.2013.804512
International Journal of Computer Mathematics
Keywords
Field
DocType
hotspot data,parameter estimation,i/o workloads,zipf-like distribution,prediction model
Data mining,Computer science,Computer data storage,Queue,Real-time computing,Input/output,Estimation theory,Data access,Hotspot (Wi-Fi),Data prediction
Journal
Volume
Issue
ISSN
91
3
0020-7160
Citations 
PageRank 
References 
0
0.34
16
Authors
6
Name
Order
Citations
PageRank
Yin Yang121.38
Zhihu Tan263.62
Changsheng Xie336666.54
Wei Liang400.34
Jie Yu500.34
Jian He610.69