Title
Chinese Character Embedding Based Semantic Query Algorithm for Semi-structured Corpora
Abstract
Semantic query is a common natural language process task for many application scenarios. Given an input phrase, the phrases in a corpus with the exact and similar meanings are expected to be responded. As the exact spelling match cannot satisfy the semantic requirements especially when the query phrase has no common words with the targeted ones, the approaches based on word embedding vector learned by neural networks are widely exploited since these vectors represent abundant semantic information. However, for a semi-structured corpus where there is no explicit context, all the above methods cannot be straightly applied effectively. In this paper, we propose CSQ, a semantic query algorithm based on Chinese character embedding. Our algorithm computes the vectors of larger language units with those of smaller language units which are computed by classical embedding models. The composition method is made for the adaptation in accordance with the lack of context, which is the essence of current embedding algorithms. Experiments show the effectiveness for the semi-structured corpora based semantic query task.
Year
DOI
Venue
2017
10.1109/BIGCOM.2017.22
2017 3rd International Conference on Big Data Computing and Communications (BIGCOM)
Keywords
Field
DocType
semistructured corpus,CSQ,composition method,semantic query task,semistructured corpora,current embedding algorithms,query phrase,semantic requirements,common natural language process task,semantic query algorithm,Chinese character embedding
Computer science,Phrase,Spelling,Natural language processing,Semantic query,Artificial intelligence,Word embedding,Artificial neural network,Embedding,Information retrieval,Algorithm,Semantic information,Natural language
Conference
ISBN
Citations 
PageRank 
978-1-5386-3350-2
0
0.34
References 
Authors
9
6
Name
Order
Citations
PageRank
Shengzhe Li100.34
Chunhong Zhang2146.37
xuan zhang39325.30
Hang Li43821.94
Ji Yang5358.74
Xiaofeng Qiu601.69