Title
A Semantic-Based Short-Text Fast Clustering Method On Hotline Records In Chengdu
Abstract
As one of the main channels for citizens to reflect urban management issues, the urban hotline can collect many problems and occurrences about the city, such as noise nuisance and illegal buildings. However, traditional manual statistical methods have been adopting to analyze the topics of the urban hotline records, and few automatic analysis models or software on topic mining of urban hotline records has been documented. In order to automatically analyze the massive amount of information on urban hotline, we propose a semantic-based short-text fast clustering method to cluster short texts of semantic similarity to form long texts according to the similarity of semantic-based keywords set, and the latent dirichlet allocation (LDA) model is then applied to mine the topics distribution of urban hotline records. Experiments on 87,055 urban hotline records from 2017 to 2018 in Chengdu show that our approach can achieve a significantly better performance both in accurate and topic coherence than LDA method.
Year
DOI
Venue
2019
10.1109/DASC/PiCom/CBDCom/CyberSciTech.2019.00103
IEEE 17TH INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP / IEEE 17TH INT CONF ON PERVAS INTELLIGENCE AND COMP / IEEE 5TH INT CONF ON CLOUD AND BIG DATA COMP / IEEE 4TH CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH)
Keywords
Field
DocType
LDA, Topic mining, Urban hotline, Text clustering
Semantic similarity,Hotline,Latent Dirichlet allocation,Information retrieval,Computer science,Feature extraction,Software,Cluster analysis,Analysis models,Semantics
Conference
Citations 
PageRank 
References 
0
0.34
0
Authors
6
Name
Order
Citations
PageRank
Xiaorong Pu18511.17
Kun Long200.34
Kecheng Chen300.34
Mei Xie400.34
Jian Cheng Lv533754.52
Dezhong Peng651.14