Title
An application of least squares fit mapping to text information retrieval
Abstract
This paper describes a unique example-based mapping method for document retrieval. We discovered that the knowledge about relevance among queries and documents can be used to obtain empirical connections between query terms and the canonical concepts which are used for indexing the content of documents. These connections do not depend on whether there are shared terms among the queries and documents; therefore, they are especially effective for a mapping from queries to the documents where the concepts are relevant but the terms used by article authors happen to be different from the terms of database users. We employ a Linear Least Squares Fit (LLSF) technique to compute such connections from a collection of queries and documents where the relevance is assigned by humans, and then use these connections in the retrieval of documents where the relevance is unknown. We tested this method on both retrieval and indexing with a set of MEDLINE documents which has been used by other information retrieval systems for evaluations. The effectiveness of the LLSF mapping and the significant improvement over alternative approaches was evident in the tests.
Year
DOI
Venue
1993
10.1145/160688.160738
SIGIR
Keywords
Field
DocType
canonical concept,medline document,information retrieval system,text information retrieval,unique example-based mapping method,llsf mapping,database user,alternative approach,squares fit,article author,document retrieval,aggregation,hypertext,indexation,structural analysis,information retrieval,least square,graph theory,clustering
Data mining,Human–computer information retrieval,Information retrieval,Computer science,Search engine indexing,Ranking (information retrieval),Relevance (information retrieval),Vector space model,Document retrieval,Cluster analysis,Visual Word
Conference
ISBN
Citations 
PageRank 
0-89791-605-0
21
20.65
References 
Authors
3
2
Name
Order
Citations
PageRank
Yiming Yang13299344.91
Christopher G Chute22349282.57