Title
Enhanced answer type inference from questions using sequential models
Abstract
Question classification is an important step in factual question answering (QA) and other dialog systems. Several attempts have been made to apply statistical machine learning approaches, including Support Vector Machines (SVMs) with sophisticated features and kernels. Curiously, the payoff beyond a simple bag-of-words representation has been small. We show that most questions reveal their class through a short contiguous token subsequence, which we call its informer span. Perfect knowledge of informer spans can enhance accuracy from 79.4% to 88% using linear SVMs on standard benchmarks. In contrast, standard heuristics based on shallow pattern-matching give only a 3% improvement, showing that the notion of an informer is non-trivial. Using a novel multi-resolution encoding of the question's parse tree, we induce a Conditional Random Field (CRF) to identify informer spans with about 85% accuracy. Then we build a meta-classifier using a linear SVM on the CRF output, enhancing accuracy to 86.2%, which is better than all published numbers.
Year
DOI
Venue
2005
10.3115/1220575.1220615
HLT/EMNLP
Keywords
DocType
Volume
linear svms,informer span,support vector machines,question classification,enhanced answer type inference,standard benchmarks,linear svm,crf output,standard heuristics,factual question answering,conditional random field,sequential model,pattern matching,question answering,type inference,support vector machine,bag of words
Conference
H05-1
Citations 
PageRank 
References 
29
1.60
15
Authors
3
Name
Order
Citations
PageRank
Vijay Krishnan119311.34
Sujatha Das2836.26
S. Chakrabarti34703999.55