Title
Inter and intra-document contexts applied in polyrepresentation for best match IR
Abstract
The principle of polyrepresentation offers a theoretical framework for handling multiple contexts in information retrieval (IR). This paper presents an empirical laboratory study of polyrepresentation in restricted mode of the information space with focus on inter and intra-document features. The Cystic Fibrosis test collection indexed in the best match system InQuery constitutes the experimental setting. Overlaps between five functionally and/or cognitively different document representations are identified. Supporting the principle of polyrepresentation, results show that in general overlaps generated by three or four representations of different nature have higher precision than those generated from two representations or the single fields. This result pertains to both structured and unstructured query mode in best match retrieval, however, with the latter query mode demonstrating higher performance. The retrieval overlaps containing search keys from the bibliographic references provide the best retrieval performance and minor MeSH terms the worst. It is concluded that a highly structured query language is necessary when implementing the principle of polyrepresentation in a best match IR system because the principle is inherently Boolean. Finally a re-ranking test shows promising results when search results are re-ranked according to precision obtained in the overlaps whilst re-ranking by citations seems less useful when integrated into polyrepresentative applications.
Year
DOI
Venue
2008
10.1016/j.ipm.2008.05.006
Inf. Process. Manage.
Keywords
Field
DocType
contextual ir,polyrepresentation,cognitive overlaps,best retrieval performance,document structure,best match retrieval,best match system,restricted mode,unstructured query mode,latter query mode,ir system,query language,information retrieval,cystic fibrosis test collection,intra-document context,overlapping generations,structured query language,indexation
SQL,Data mining,Query language,Information retrieval,Computer science,Document Structure Description,Information space
Journal
Volume
Issue
ISSN
44
5
Information Processing and Management
Citations 
PageRank 
References 
21
1.13
21
Authors
3
Name
Order
Citations
PageRank
Mette Skov1473.45
Birger Larsen294455.00
PETER INGWERSEN32192291.28