Title
Extracting News Events from Microblogs.
Abstract
Twitter stream has become a large source of information, but the magnitude of tweets posted and the noisy nature of its content makes harvesting of knowledge from Twitter has challenged researchers for long time. Aiming at overcoming some of the main challenges of extracting hidden information from tweet streams, this work proposes a new approach for real-time detection of news events from the Twitter stream. We divide our approach into three steps. The first step is to use a neural network or deep learning to detect news-relevant tweets from the stream. The second step is to apply a novel streaming data clustering algorithm to the detected news tweets to form news events. The third and final step is to rank the detected events based on the size of the event clusters and growth speed of the tweet frequencies. We evaluate the proposed system on a large, publicly available corpus of annotated news events from Twitter. As part of the evaluation, we compare our approach with a related state-of-theart solution. Overall, our experiments and user-based evaluation show that our approach on detecting current (real) news events delivers a state-of-the-art performance.
Year
DOI
Venue
2018
10.1080/09720510.2018.1486273
JOURNAL OF STATISTICS & MANAGEMENT SYSTEMS
Keywords
Field
DocType
Text mining,Deep Learning,Word Embedding,Information Extraction,Event Detection,Social Media
Social media,Information retrieval,Computer science,Microblogging,Information extraction,Artificial intelligence,Word embedding,Deep learning
Journal
Volume
Issue
ISSN
21.0
4
0972-0510
Citations 
PageRank 
References 
0
0.34
0
Authors
2
Name
Order
Citations
PageRank
Øystein Repp100.34
Heri Ramampiaro215420.46