Title
Finding Core Topics: Topic Extraction with Clustering on Tweet
Abstract
Twitter is one of the most popular microblogging services that lets users post short text called Tweet. Tweet is distinguished from conventional text data in that it is typically composed of short and informal message, and it makes typical text analysis methods do not work well. Accordingly, extracting meaningful topics from tweets brings up new challenges. In this work, we propose a simple and novel method called Core-Topic-based Clustering (CTC), which extracts topics and cluster tweets simultaneously based on the clustering principles: minimizing the inter-cluster similarity and maximizing the intra-cluster similarity. Experimental results show that our method efficiently extracts meaningful topics, and the clustering performance is better than K-means algorithm.
Year
DOI
Venue
2012
10.1109/CGC.2012.120
CGC
Keywords
Field
DocType
typical text analysis method,clustering performance,pattern clustering,intracluster similarity,extracts meaningful topic,novel method,topic extraction,finding core topics,social network,k-means clustering algorithm,twitter,meaningful topic,core-topic-based clustering,clustering principle,microblogging service,inter-cluster similarity,document clustering,intra-cluster similarity,tweet clustering,text analysis method,intercluster similarity,short text,social networking (online),conventional text data
Data mining,Clustering high-dimensional data,Social media,Social network,Information retrieval,Document clustering,Computer science,Microblogging,Consensus clustering,Conceptual clustering,Brown clustering,Cluster analysis
Conference
ISBN
Citations 
PageRank 
978-1-4673-3027-5
5
0.57
References 
Authors
9
5
Name
Order
Citations
PageRank
Sungchul Kim110810.36
Sungho Jeon210423.08
Jin-ha Kim332918.78
Young-Ho Park4467.94
Hwanjo Yu51715114.02