Title
A Study of Feature Construction for Text-based Forecasting of Time Series Variables.
Abstract
Time series are ubiquitous in the world since they are used to measure various phenomena (e.g., temperature, spread of a virus, sales, etc.). Forecasting of time series is highly beneficial (and necessary) for optimizing decisions, yet is a very challenging problem; using only the historical values of the time series is often insufficient. In this paper, we study how to construct effective additional features based on related text data for time series forecasting. Besides the commonly used n-gram features, we propose a general strategy for constructing multiple topical features based on the topics discovered by a topic model. We evaluate feature effectiveness using a data set for predicting stock price changes where we constructed additional features from news text articles for stock market prediction. We found that: 1) Text-based features outperform time series-based features, suggesting the great promise of leveraging text data for improving time series forecasting. 2) Topic-based features are not very effective stand-alone, but they can further improve performance when added on top of n-gram features. 3) The best topic-based feature appears to be a long-term aggregation of topics over time with high weights on recent topics.
Year
DOI
Venue
2017
10.1145/3132847.3133109
CIKM
Field
DocType
ISBN
Time series,Data mining,Stock price,Computer science,Artificial intelligence,Deep learning,Topic model,Stock market prediction,Machine learning,Graph (abstract data type)
Conference
978-1-4503-4918-5
Citations 
PageRank 
References 
2
0.37
9
Authors
4
Name
Order
Citations
PageRank
Yiren Wang1123.92
Dominic Seyler294.29
Shubhra Kanti Karmaker Santu3302.55
ChengXiang Zhai411908649.74