Title
A multi-relational term scheme for first story detection.
Abstract
First Story Detection (FSD) aims to identify the first story for an emerging event previously unreported, which is essential to practical applications in news analysis, intelligence gathering, and national security. Compared to information retrieval, text clustering, text classification, and other subject-based tasks, FSD is event-based and thus faces the challenging issues of multiple events on the same subject and the evolution of events. To tackle these challenges, several schemes for exploiting temporal information, named entity, and topic modeling, have been proposed for FSD. In this paper, we present a new term weighting scheme called LGT, which jointly models the Local element, Global element, and Topical association of each story. An unsupervised algorithm based on LGT is then devised and applied to FSD. We evaluate 4 feature reduction strategies and test our LGT scheme on an online model. Experiments show that our approach yields better results than existing baseline schemes on both retrospective and online FSD.
Year
DOI
Venue
2017
10.1016/j.neucom.2016.06.089
Neurocomputing
Keywords
Field
DocType
First story detection,Latent Dirichlet allocation,Feature reduction,Synonymous,polysemous
Data mining,Latent Dirichlet allocation,Weighting,Computer science,Document clustering,Artificial intelligence,Online model,Pattern recognition,Global element,Named entity,News analytics,Topic model,Machine learning
Journal
Volume
ISSN
Citations 
254
0925-2312
3
PageRank 
References 
Authors
0.38
31
6
Name
Order
Citations
PageRank
Yanghui Rao125623.32
Qing Li230.38
Qingyuan Wu3202.75
Haoran Xie445071.21
Fu Lee Wang5926118.55
Tao Wang6657.27