An improved method for semantic similarity calculation based on stop-words - Citegraph

Paper Info

Title
An improved method for semantic similarity calculation based on stop-words

Abstract
Text similarity calculation has become one of the key issues of many applications such as information retrieval, semantic disambiguation, automatic question answering. There are increasing needs of similarity calculations in different levels, e.g. characters, vocabularies, syntactic structures and semantic etc. Most of existing semantic similarity algorithms can be categorized into statistical based methods, rule based methods and combination of these two methods. Statistical methods use knowledge bases to incorporate more comprehensive knowledge and have the capability of reducing knowledge noise. So they are able to obtain better performance. Nevertheless, for the unbalanced distribution of different items in the knowledge base, semantic similarity calculation performance for low-frequency words is usually poor. In this work, based on the distributions of stop-words, we proposes a weights normalization method for semantic dimensions. The proposed method uses the semantic independence of stop-words to avoid semantic bias of corpus in statistical methods. It further improves the accuracy of semantic similarity computation. Experiments compared with several existing algorithms show the effectiveness of the proposed method. © Springer-Verlag Berlin Heidelberg 2014.

Year	DOI	Venue
2014	10.1007/978-3-662-45652-1_34	Communications in Computer and Information Science
Keywords	Field	DocType
ESA,Semantic dimension normalization,Semantic similarity,Stop-words	Semantic similarity,Rule-based system,Normalization (statistics),Question answering,Information retrieval,Computer science,Knowledge base,Syntax,Stop words,Computation	Conference
Volume	ISSN	ISBN
481	18650929	9783662456514
Citations	PageRank	References
0	0.34	7
Authors
3

Authors (3 rows)

Cited by (0 rows)

References (7 rows)

Name	Order	Citations	PageRank
Li Haodi	1	25	3.89
Qingcai Chen	2	809	66.72
Xiaolong Wang	3	1208	115.39

1