Title
An empirical study on retrieval models for different document genres: patents and newspaper articles
Abstract
Reflecting the rapid growth in the utilization of large test collections for information retrieval since the 1990s, extensive comparative experiments have been performed to explore the effectiveness of various retrieval models. However, most collections were intended for retrieving newspaper articles and technical abstracts. In this paper, we describe the process of producing a test collection for patent retrieval, the NTCIR-3 Patent Retrieval Collection, which includes two years of Japanese patent applications and 31 topics produced by professional patent searchers. We also report experimental results obtained by using this collection to re-examine the effectiveness of existing retrieval models in the context of patent retrieval. The relative superiority among existing retrieval models did not significantly differ depending on the document genre, that is, patents and newspaper articles. Issues related to patent retrieval are also discussed.
Year
DOI
Venue
2003
10.1145/860435.860482
SIGIR
Keywords
Field
DocType
test collection,large test collection,various retrieval model,patent retrieval,japanese patent application,information retrieval,retrieving newspaper article,newspaper article,empirical study,professional patent searcher,retrieval model,different document genre
Data mining,World Wide Web,Human–computer information retrieval,Information retrieval,Data retrieval,Computer science,Patent retrieval,Newspaper,Relevance (information retrieval),Document retrieval,Empirical research
Conference
ISBN
Citations 
PageRank 
1-58113-646-3
35
2.19
References 
Authors
7
4
Name
Order
Citations
PageRank
Makoto Iwayama143687.03
Atsushi Fujii248659.25
Noriko Kando31474209.89
Yuzo Marukawa4407.39