Title
A Scalable Approach Based On Deep Learning For Big Data Time Series Forecasting
Abstract
This paper presents a method based on deep learning to deal with big data times series forecasting. The deep feed forward neural network provided by the H2O big data analysis framework has been used along with the Apache Spark platform for distributed computing. Since H2O does not allow the conduction of multi-step regression, a general-purpose methodology that can be used for prediction horizons with arbitrary length is proposed here, being the prediction horizon, h, the number of future values to be predicted. The solution consists in splitting the problem into h forecasting subproblems, being h the number of samples to be simultaneously predicted. Thus, the best prediction model for each subproblem can be obtained, making easier its parallelization and adaptation to the big data context. Moreover, a grid search is carried out to obtain the optimal hyperparameters of the deep learning-based approach. Results from a real-world dataset composed of electricity consumption in Spain, with a ten-minute frequency sampling rate, from 2007 to 2016 are reported. In particular, the accuracy and runtimes versus computing resources and size of the dataset are analyzed. Finally, the performance and the scalability of the proposed method is compared to other recently published techniques, showing to be a suitable method to process big data time series.
Year
DOI
Venue
2018
10.3233/ICA-180580
INTEGRATED COMPUTER-AIDED ENGINEERING
Keywords
Field
DocType
Deep learning, time series forecasting, big data
Time series,Computer science,Artificial intelligence,Deep learning,Big data,Machine learning,Scalability
Journal
Volume
Issue
ISSN
25
4
1069-2509
Citations 
PageRank 
References 
5
0.42
34
Authors
4
Name
Order
Citations
PageRank
José F. Torres1242.46
Antonio Galicia2181.70
Alicia Troncoso Lora311712.72
Francisco Martínez-Álvarez415523.98