Title
What Is Your Tweet Worldview? Mapping The Topic Structure Of Tweets On The Wikipedia
Abstract
Purpose This paper aims to propose a method for summarizing the topics of tweets using the Wikipedia category structure as common knowledge for supplementing the understanding of the Twitter user's interests. There are many topics in the tweets, and the topics can be treated as a tree structure. However, when the topic hierarchy is constructed using existing hierarchal clustering approach, the granularity of tweet groups differs for each user. For summarizing the topics, identification of the topics which are heterogeneous and which are not is necessary because it is easy to understand if several groups are categorized into parent groups. However, if the group units are different for each user, a number of users' interests cannot be summarized. If some tweets are grouped into the presidential election, and the others are into Donald Trump, there cannot be a count of how many users are interested in Donald Trump.Design/methodology/approach One solution of this issue is to construct topic structures by mapping one common tree structure. In this paper, a method is proposed for constructing the topic structure using the Wikipedia category tree similar to a common tree structure. The tweets are categorized, mapped to titles of articles in the Wikipedia category tree and then visualized as the hierarchal structure to the users.Findings The effectiveness of the proposed hierarchal topic structure is confirmed. In theme politics, the proposed method works well. The main reason is that there are many technical terms about politics in the Wikipedia categories and articles. It was found that a number of the terms of politics do not have multiple meanings, multiple semantics. However, in theme sports, the proposed method does not perform well. The main reason for this case is that there are a number of names of people present as topic names.Originality/value One important feature of the proposed method is that it is easy to grasp not only about the topics which are heterogeneous or homogeneous with each other but also consider the missing time when extracting topics. Another feature is that the topic structures for multiple users are easy to compare with each other.
Year
DOI
Venue
2018
10.1108/IJPCC-D-18-00008
INTERNATIONAL JOURNAL OF PERVASIVE COMPUTING AND COMMUNICATIONS
Keywords
Field
DocType
Twitter, Browsing time, Uninformed information
Topic structure,GRASP,Information retrieval,Computer science,Originality,Common knowledge,Tree structure,Cluster analysis,Hierarchy,Semantics,Distributed computing
Journal
Volume
Issue
ISSN
14
1
1742-7371
Citations 
PageRank 
References 
0
0.34
9
Authors
3
Name
Order
Citations
PageRank
Yu Suzuki144.21
Hiromitsu Ohara200.68
Akiyo Nadamoto318934.24