Title
A time-dependent topic model for multiple text streams
Abstract
In recent years social media have become indispensable tools for information dissemination, operating in tandem with traditional media outlets such as newspapers, and it has become critical to understand the interaction between the new and old sources of news. Although social media as well as traditional media have attracted attention from several research communities, most of the prior work has been limited to a single medium. In addition temporal analysis of these sources can provide an understanding of how information spreads and evolves. Modeling temporal dynamics while considering multiple sources is a challenging research problem. In this paper we address the problem of modeling text streams from two news sources - Twitter and Yahoo! News. Our analysis addresses both their individual properties (including temporal dynamics) and their inter-relationships. This work extends standard topic models by allowing each text stream to have both local topics and shared topics. For temporal modeling we associate each topic with a time-dependent function that characterizes its popularity over time. By integrating the two models, we effectively model the temporal dynamics of multiple correlated text streams in a unified framework. We evaluate our model on a large-scale dataset, consisting of text streams from both Twitter and news feeds from Yahoo! News. Besides overcoming the limitations of existing models, we show that our work achieves better perplexity on unseen data and identifies more coherent topics. We also provide analysis of finding real-world events from the topics obtained by our model.
Year
DOI
Venue
2011
10.1145/2020408.2020551
KDD
Keywords
Field
DocType
temporal dynamic,addition temporal analysis,social media,multiple text stream,multiple correlated text stream,time-dependent topic model,text stream,news source,traditional media outlet,traditional media,temporal modeling,analysis address,topic models,news
Data science,Perplexity,Data mining,Social media,Computer science,Popularity,Newspaper,Temporal modeling,Topic model,Information Dissemination,STREAMS
Conference
Citations 
PageRank 
References 
51
1.70
27
Authors
4
Name
Order
Citations
PageRank
Liangjie Hong1131254.89
Byron Dom22600825.93
Siva Gurumurthy341016.99
Kostas Tsioutsiouliklis475235.84