Title
Research on Policy Text Clustering Algorithm Based on LDA-Gibbs Model.
Abstract
Policy text contains large amount of diversified data and strictly conforms to standards and specifications, but the traditional text clustering method cannot solve the problems of high dimensionality, sparse features, and similar meanings, so this paper proposes a weighted algorithm based on the LDA-Gibbs model to improve the accuracy of policy text clustering. Firstly, it provides realistic basis for the assumptions of the LDA-Gibbs topic model and the weighted algorithm; secondly, it pre-processes the existing policy text simulated data, establishes the LDA-Gibbs model, forms a weighted algorithm, and generates training data to determine the number of optimal topics in the LDA-Gibbs model and completes the final clustering of the policy text; finally, by summarizing, classifying and deducing the conclusions of the experimental data, this paper proves the objective validity and effects of this method. Hopefully the overall design of this method can be applied in the prospective study on the formulation of new policies in the future, the retrospective evaluation and testing of the existing policies and the formation of a two-way interactive mechanism.
Year
DOI
Venue
2019
10.20965/jaciii.2019.p0268
JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS
Keywords
Field
DocType
LDA-Gibbs,topic model,text clustering,weighted algorithm
Computer science,Document clustering,Artificial intelligence,Topic model,Machine learning
Journal
Volume
Issue
ISSN
23
2
1343-0130
Citations 
PageRank 
References 
0
0.34
0
Authors
2
Name
Order
Citations
PageRank
Haiqun Ma100.34
Tao Zhang222069.03