Abstract | ||
---|---|---|
Policy text contains large amount of diversified data and strictly conforms to standards and specifications, but the traditional text clustering method cannot solve the problems of high dimensionality, sparse features, and similar meanings, so this paper proposes a weighted algorithm based on the LDA-Gibbs model to improve the accuracy of policy text clustering. Firstly, it provides realistic basis for the assumptions of the LDA-Gibbs topic model and the weighted algorithm; secondly, it pre-processes the existing policy text simulated data, establishes the LDA-Gibbs model, forms a weighted algorithm, and generates training data to determine the number of optimal topics in the LDA-Gibbs model and completes the final clustering of the policy text; finally, by summarizing, classifying and deducing the conclusions of the experimental data, this paper proves the objective validity and effects of this method. Hopefully the overall design of this method can be applied in the prospective study on the formulation of new policies in the future, the retrospective evaluation and testing of the existing policies and the formation of a two-way interactive mechanism. |
Year | DOI | Venue |
---|---|---|
2019 | 10.20965/jaciii.2019.p0268 | JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS |
Keywords | Field | DocType |
LDA-Gibbs,topic model,text clustering,weighted algorithm | Computer science,Document clustering,Artificial intelligence,Topic model,Machine learning | Journal |
Volume | Issue | ISSN |
23 | 2 | 1343-0130 |
Citations | PageRank | References |
0 | 0.34 | 0 |
Authors | ||
2 |