Abstract | ||
---|---|---|
The Twitter microblogging site is one of the most popular websites in the Web today, where millions of users post real-time messages (tweets) on different topics of their interest. The content that becomes popular in Twitter (i.e., discussed by a large number of users) on a certain day can be used for a variety of purposes, including recommendation of popular content and marketing and advertisement campaigns. In this scenario, it would be of great interest to be able to predict what content will become popular topics of discussion in Twitter in the recent future. This problem is very challenging due to the inherent dynamicity in the Twitter system, where topics can become hugely popular within short intervals of time. The Twitter site periodically declares a set of trending topics, which are the keywords (e.g., hashtags) that are at the center of discussion in the Twitter network at a given point of time. However, the exact algorithm that Twitter uses to identify the trending topics at a certain time is not known publicly. In this paper, we aim to predict the keywords (hashtags) that are likely to become trending in Twitter in the recent future. We model this prediction task as a machine learning classification problem, and analyze millions of tweets from the Twitter stream to identify features for distinguishing between trending hashtags and non-trending ones. We train classifiers on features measured over one day, and use the classifiers to distinguish between trending and non-trending hashtags on the next day. The classifiers achieve very high precision and reasonably high recall in identifying the hashtags that are likely to become trending. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1007/978-3-319-20294-5_49 | Lecture Notes in Computer Science |
Keywords | Field | DocType |
Online social network,Twitter,Trending topics,Predicting trends,Machine learning,Classification | Social media,Social network,Exact algorithm,Computer science,Microblogging,Artificial intelligence,Statistical classification,Machine learning | Conference |
Volume | ISSN | Citations |
8947 | 0302-9743 | 0 |
PageRank | References | Authors |
0.34 | 0 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Anubrata Das | 1 | 0 | 0.68 |
Moumita Roy | 2 | 50 | 8.71 |
Soumi Dutta | 3 | 0 | 0.68 |
Saptarshi Ghosh | 4 | 594 | 53.82 |
Asit Kumar Das | 5 | 73 | 16.06 |