Title
Hard Drive Failure Prediction Using Big Data
Abstract
We design a general framework named Hdoctor for hard drive failure prediction. Hdoctor leverages the power of big data to achieve a significant improvement comparing to all previous researches that used sophisticated machine learning algorithms. Hdoctor exhibits a series of engineering innovations: (1) constructing time dependent features to characterize the Self-Monitoring, Analysis and Reporting Technology (SMART) value transitions during disk failures, (2) combining features to enable the model to learn the correlation among different SMART attributes, (3) regarding circumstance data such as cluster workload, temperature, humidity, location as related features. Meanwhile, Hdoctor collects/labels samples and updates model automatically, and works well for all kinds of disk failure prediction in our intelligent data center. In this work, we use Hdoctor to collect 74,477,717 training records from our clusters involving 220,022 disks. By training a simple and scalable model, our system achieves a detection rate of 97.82%, with a false alarm rate (FAR) of 0.3%, which hugely outperforms all previous algorithms. In addition, Hdoctor is an excellent indicator for how to predict different hardware failures efficiently under various circumstances.
Year
DOI
Venue
2015
10.1109/SRDSW.2015.15
SRDS Workshop
Keywords
DocType
ISSN
hard drive failure prediction,Big Data,Hdoctor framework,self-monitoring analysis and reporting technology SMART,false alarm rate,FAR
Conference
1060-9857
Citations 
PageRank 
References 
5
0.45
7
Authors
5
Name
Order
Citations
PageRank
Wenjun Yang1111.63
Dianming Hu250.45
Yuliang Liu350.45
Shuhao Wang4202.54
Tianming Jiang550.79