Abstract | ||
---|---|---|
Twitter is one of the most popular microblogging services that lets users post short text called Tweet. Tweet is distinguished from conventional text data in that it is typically composed of short and informal message, and it makes typical text analysis methods do not work well. Accordingly, extracting meaningful topics from tweets brings up new challenges. In this work, we propose a simple and novel method called Core-Topic-based Clustering (CTC), which extracts topics and cluster tweets simultaneously based on the clustering principles: minimizing the inter-cluster similarity and maximizing the intra-cluster similarity. Experimental results show that our method efficiently extracts meaningful topics, and the clustering performance is better than K-means algorithm. |
Year | DOI | Venue |
---|---|---|
2012 | 10.1109/CGC.2012.120 | CGC |
Keywords | Field | DocType |
typical text analysis method,clustering performance,pattern clustering,intracluster similarity,extracts meaningful topic,novel method,topic extraction,finding core topics,social network,k-means clustering algorithm,twitter,meaningful topic,core-topic-based clustering,clustering principle,microblogging service,inter-cluster similarity,document clustering,intra-cluster similarity,tweet clustering,text analysis method,intercluster similarity,short text,social networking (online),conventional text data | Data mining,Clustering high-dimensional data,Social media,Social network,Information retrieval,Document clustering,Computer science,Microblogging,Consensus clustering,Conceptual clustering,Brown clustering,Cluster analysis | Conference |
ISBN | Citations | PageRank |
978-1-4673-3027-5 | 5 | 0.57 |
References | Authors | |
9 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Sungchul Kim | 1 | 108 | 10.36 |
Sungho Jeon | 2 | 104 | 23.08 |
Jin-ha Kim | 3 | 329 | 18.78 |
Young-Ho Park | 4 | 46 | 7.94 |
Hwanjo Yu | 5 | 1715 | 114.02 |