Title
Latent semantic analysis for vector space expansion and fuzzy logic-based genetic clustering
Abstract
This paper proposes an improved latent semantic analysis (LSA) model to represent textual document and takes advantage of a fuzzy logic based genetic algorithm (FLGA) for clustering. The standard genetic algorithm (GA) in conventional vector space model is rather difficult to deal with because the high dimensional encoding of GA makes it explore the optimal solution in a complicated space which is prone to cause an overflow problem. The LSA-based corpus model not only reduces the dimensions drastically, but also creates an underlying semantic structure which enhances its ability of distinguishing documents in terms of concepts and indirectly improves the ability of GA for clustering (genetic clustering). A novel FLGA is proposed in conjunction with this semantic model in this study. According to the nature of biological evolution, several fuzzy controllers are given to adaptively adjust and optimize the behaviors of the GA which can effectively prevent the premature convergence to a suboptimum solution. The experiment results show that the fuzzy logic controllers enhance the ability of the GA to explore the global optimum solution, and the utilization of the LSA-based text representation method to FLGA further improves its clustering performance.
Year
DOI
Venue
2010
10.1007/s10115-009-0191-5
Knowl. Inf. Syst.
Keywords
Field
DocType
genetic algorithm,latent semantic analysis,global optimum solution,fuzzy logic controller,lsa-based corpus model,fuzzy logic,clustering · latent semantic analysis · dimensionality reduction · genetic algorithm · fuzzy logic · multidimensional scaling,fuzzy logic-based genetic clustering,fuzzy controller,genetic clustering,clustering performance,semantic model,conventional vector space model,vector space expansion,vector space model,genetics,vector space,multidimensional scaling,premature convergence
Fuzzy clustering,Data mining,Premature convergence,Computer science,Fuzzy logic,Artificial intelligence,Vector space model,Cluster analysis,Latent semantic analysis,Genetic algorithm,Machine learning,Semantic data model
Journal
Volume
Issue
ISSN
22
3
0219-3116
Citations 
PageRank 
References 
6
0.46
26
Authors
2
Name
Order
Citations
PageRank
Wei Song111315.51
Soon Cheol Park219714.78