Title
Short text classification by detecting information path
Abstract
Short text is becoming ubiquitous in many modern information systems. Due to the shortness and sparseness of short texts, there are less informative word co-occurrences among them, which naturally pose great difficulty for classification tasks on such data. To overcome this difficulty, this paper proposes a new way for effectively classifying the short texts. Our method is based on a key observation that there usually exists ordered subsets in short texts, which is termed ``information path'' in this work, and classification on each subset based on the classification results of some pervious subsets can yield higher overall accuracy than classifying the entire data set directly. We propose a method to detect the information path and employ it in short text classification. Different from the state-of-art methods, our method does not require any external knowledge or corpus that usually need careful fine-tuning, which makes our method easier and more robust on different data sets. Experiments on two real world data sets show the effectiveness of the proposed method and its superiority over the existing methods.
Year
DOI
Venue
2013
10.1145/2505515.2505638
CIKM
Keywords
Field
DocType
short text classification,information path,entire data,short text,different data set,classification task,state-of-art method,existing method,classification result
Information system,Data mining,Data set,Information retrieval,Computer science,Artificial intelligence,Machine learning
Conference
Citations 
PageRank 
References 
5
0.44
17
Authors
6
Name
Order
Citations
PageRank
Shitao Zhang150.44
Xiaoming Jin231523.42
Dou Shen3122459.46
Bin Cao457325.94
Xuetao Ding550.44
Xiaochen Zhang650.77