Title
Short Text Similarity Calculation Using Semantic Information
Abstract
Text similarity is one of the important methods of text data analysis, which is often used in text clustering and classification. Social media is a new and popular online social application that contains a lot of valuable information. Short text is common in social media, and short text similarity is often used for social media data mining. The similarity calculation of short text is influenced by the small feature of text words and the accuracy is low. so it is a common improvement method to calculate the similarity of short texts with word semantic similarity. This paper put forward a short text semantic similarity calculation method that combine knowledge-based method and corpus-based method. This method is based on the improved word semantic similarity calculation method and general short text semantic similarity calculation method. The word similarity calculation method combines two word semantic similarity by some strategies. It takes the advantages of two methods to overcome the disadvantages of single one, finds out more semantic association among words in texts, and improves accuracy of word similarity calculation. This paper uses a large number of corpus to compare and analyze several word and text semantic similarity algorithms, the improved method has a closer result to human ratings than other methods in both word and text similarity.
Year
DOI
Venue
2017
10.1109/BIGCOM.2017.53
2017 3rd International Conference on Big Data Computing and Communications (BIGCOM)
Keywords
Field
DocType
short text,semantic similarity,knowledge-based,corpus-based
Semantic similarity,Social media,Information retrieval,Semantic association,Document clustering,Computer science,Explicit semantic analysis,Semantic information,Natural language processing,Artificial intelligence
Conference
ISBN
Citations 
PageRank 
978-1-5386-3350-2
1
0.39
References 
Authors
4
6
Name
Order
Citations
PageRank
Haoyu Pu110.39
Gaolei Fei211.07
Hailin Zhao310.73
Guang-min Hu48719.78
Chengbo Jiao510.39
Zhoujun Xu6142.69