Abstract | ||
---|---|---|
As one of the main channels for citizens to reflect urban management issues, the urban hotline can collect many problems and occurrences about the city, such as noise nuisance and illegal buildings. However, traditional manual statistical methods have been adopting to analyze the topics of the urban hotline records, and few automatic analysis models or software on topic mining of urban hotline records has been documented. In order to automatically analyze the massive amount of information on urban hotline, we propose a semantic-based short-text fast clustering method to cluster short texts of semantic similarity to form long texts according to the similarity of semantic-based keywords set, and the latent dirichlet allocation (LDA) model is then applied to mine the topics distribution of urban hotline records. Experiments on 87,055 urban hotline records from 2017 to 2018 in Chengdu show that our approach can achieve a significantly better performance both in accurate and topic coherence than LDA method. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1109/DASC/PiCom/CBDCom/CyberSciTech.2019.00103 | IEEE 17TH INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP / IEEE 17TH INT CONF ON PERVAS INTELLIGENCE AND COMP / IEEE 5TH INT CONF ON CLOUD AND BIG DATA COMP / IEEE 4TH CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH) |
Keywords | Field | DocType |
LDA, Topic mining, Urban hotline, Text clustering | Semantic similarity,Hotline,Latent Dirichlet allocation,Information retrieval,Computer science,Feature extraction,Software,Cluster analysis,Analysis models,Semantics | Conference |
Citations | PageRank | References |
0 | 0.34 | 0 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Xiaorong Pu | 1 | 85 | 11.17 |
Kun Long | 2 | 0 | 0.34 |
Kecheng Chen | 3 | 0 | 0.34 |
Mei Xie | 4 | 0 | 0.34 |
Jian Cheng Lv | 5 | 337 | 54.52 |
Dezhong Peng | 6 | 5 | 1.14 |