Abstract | ||
---|---|---|
There is a notable characteristic of the data access pattern: 80% I/O requests only access 20% data. This feature brings about the concept of hotspot data, which refer to the data in the most frequent requested area. The access to these hotspot data has direct influence upon the performance of the storage system's applications. Therefore, how to predict hotspot data is a critical research focus in the optimization of the storage system. In this paper, we propose a hotspot data prediction model based on a Zipf-like distribution, which can estimate and dynamically adjust parameters according to the present statistics of I/O access. We classify the hotspot data from every trace, and analyse the prediction rate through the classified hotspot data's characteristic. We synthesize the analysis results in different time granularities and hotspot data prediction queue lengths. Finally, we use block I/O traces to discuss the effectiveness of this model. The discussion and analysis results indicate that this model can predict the hotspot data efficiently. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1080/00207160.2013.804512 | International Journal of Computer Mathematics |
Keywords | Field | DocType |
hotspot data,parameter estimation,i/o workloads,zipf-like distribution,prediction model | Data mining,Computer science,Computer data storage,Queue,Real-time computing,Input/output,Estimation theory,Data access,Hotspot (Wi-Fi),Data prediction | Journal |
Volume | Issue | ISSN |
91 | 3 | 0020-7160 |
Citations | PageRank | References |
0 | 0.34 | 16 |
Authors | ||
6 |