Title
A classification approach to Boolean query reformulation
Abstract
One of the difficulties in using current Boolean-based information retrieval systems is that it is hard for a user, especially a novice, to formulate an effective Boolean query. Query reformulation can be even more difficult and complex than formulation since users often have difficulty incorporating the new information gained from the previous search into the next query. In this article, query reformulation is viewed as a classification problem, that is, classifying documents as either relevant or nonrelevant. A new reformulation algorithm is proposed which builds a tree-structured classifier, called a query tree, at each reformulation from a set of feedback documents retrieved from the previous search. The query tree can easily be transformed into a Boolean query. The query tree is compared to two query reformulation algorithms on benchmark test sets (CACM, CISI, and Medlars). In most experiments, the query tree showed significant improvements in precision over the two algorithms compared in this study, We attribute this improved performance to the ability of the query tree algorithm to select good search terms and to represent the relationships among search terms into a tree structure.
Year
DOI
Venue
1997
3.3.CO;2-H" target="_self" class="small-link-text"10.1002/(SICI)1097-4571(199708)48:83.3.CO;2-H
JASIS
Field
DocType
Volume
Query optimization,Data mining,Web search query,Query language,RDF query language,Query expansion,Information retrieval,Computer science,Sargable,Web query classification,Ranking (information retrieval)
Journal
48
Issue
ISSN
Citations 
8
0002-8231
11
PageRank 
References 
Authors
1.19
2
3
Name
Order
Citations
PageRank
James C. French1981386.44
Donald E. Brown2640188.70
Nam-ho Kim3163.27