Title
DM-PAS: A Data Mining Prefetching Algorithm for Storage System
Abstract
This paper is motivated by a global online satellite images distribution system operated at the Earth Resources Observation and Science (EROS) center of the U.S Geological Survey. Fundamental objectives of EROS include, but are not limited to, building high-speed and cost-effective massive data processing and storage systems to support online satellite images distribution. Hybrid storage systems--containing solid-state drives (SSD), hard disks (HDD), and tapes--can provide an ideal data storage solution for a wide variety of data processing centers like EROS. Large-scale hybrid storage systems will become increasingly popular in the next few years for the following two reasons. First, highly accessed storage objects in a hybrid storage system can be prefetched and cached to high-speed storage components such as solid-state drives. SSD-based hybrid storage system can provide large storage capacity, high I/O performance and data reliability. Second, hybrid storage systems are cost-effective, because inexpensive tapes help in increasing storage capacities at very low cost. Transferring data back and forth among SSDs, HDDs, and tapes plays a critical role in achieving high I/O performance. Thus, we proposed data mining algorithms that can judiciously prefetch data. Our analytical model and the experimental results reveals that our data mining prefetching algorithm increase the performance of the hybrid storage systems.
Year
DOI
Venue
2011
10.1109/HPCC.2011.71
HPCC
Keywords
Field
DocType
o performance,large-scale hybrid storage system,ideal data storage solution,accessed storage object,storage system,storage capacity,high-speed storage component,ssd-based hybrid storage system,hybrid storage system,data mining prefetching algorithm,large storage capacity,data mining,data storage,cost effectiveness,internet,data processing
Data mining,Converged storage,EMC Invista,Computer science,Cache,Computer data storage,Real-time computing,Distributed computing,Object storage,Data striping,Algorithm,Information repository,Storage area network
Conference
Citations 
PageRank 
References 
0
0.34
21
Authors
5
Name
Order
Citations
PageRank
Mais Nijim114414.08
Yousef Nijim262.18
Remzi Sker300.34
Vamshi Reddy400.34
Raghu Nandi Raju500.34