Title
Automatic targeted-domain spatiotemporal event detection in twitter.
Abstract
Twitter has become an important data source for detecting events, especially tracking detailed information for events of a specific domain. Previous studies on targeted-domain Twitter information extraction have used supervised learning techniques to identify domain-related tweets, however, the need for extensive manual labeling makes these supervised systems extremely expensive to build and maintain. What's more, most of these existing work fail to consider spatiotemporal factors, which are essential attributes of target-domain events. In this paper, we propose a semi-supervised method for Automatical Targeted-domain Spatiotemporal Event Detection (ATSED) in Twitter. Given a targeted domain, ATSED first learns tweet labels from historical data, and then detects on-going events from real-time Twitter data streams. Specifically, an efficient label generation algorithm is proposed to automatically recognize tweet labels from domain-related news articles, a customized classifier is created for Twitter data analysis by utilizing tweets' distinguishing features, and a novel multinomial spatial-scan model is provided to identify geographical locations for detected events. Experiments on 305 million tweets demonstrated the effectiveness of this new approach.
Year
DOI
Venue
2016
10.1007/s10707-016-0263-0
GeoInformatica
Keywords
Field
DocType
Social media,Data mining,Spatiotemporal
Data source,Data mining,Data stream mining,Social media,Computer science,Multinomial distribution,Supervised learning,Information extraction,Classifier (linguistics)
Journal
Volume
Issue
ISSN
20
4
1384-6175
Citations 
PageRank 
References 
7
0.43
19
Authors
5
Name
Order
Citations
PageRank
Ting Hua11025.59
Feng Chen245148.47
Liang Zhao338654.50
Chang-Tien Lu41097115.77
Naren Ramakrishnan51913176.25