Algorithms for within-cluster searches using inverted files - Citegraph

Paper Info

Title
Algorithms for within-cluster searches using inverted files

Abstract
Information retrieval over clustered document collections has two successive stages: first identifying the best-clusters and then the best-documents in these clusters that are most similar to the user query. In this paper, we assume that an inverted file over the entire document collection is used for the latter stage. We propose and evaluate algorithms for within-cluster searches, i.e., to integrate the best-clusters with the best-documents to obtain the final output including the highest ranked documents only from the best-clusters. Our experiments on a TREC collection including 210,158 documents with several query sets show that an appropriately selected integration algorithm based on the query length and system resources can significantly improve the query evaluation efficiency.

Year	DOI	Venue
2006	10.1007/11902140_74	ISCIS
Keywords	Field	DocType
within-cluster search,final output,information retrieval,trec collection,entire document collection,document collection,user query,query length,inverted file,integration algorithm,query set,query evaluation efficiency	Inverted index,Query optimization,Web search query,Data mining,Query language,Query expansion,Information retrieval,Computer science,Web query classification,Algorithm,Ranking (information retrieval),Document retrieval	Conference
Volume	ISSN	ISBN
4263	0302-9743	3-540-47242-8
Citations	PageRank	References
4	0.50	6
Authors
3

Authors (3 rows)

Cited by (4 rows)

References (6 rows)

Name	Order	Citations	PageRank
Ismail Sengor Altingovde	1	320	29.96
Fazli Can	2	581	94.63
Özgür Ulusoy	3	1250	113.15

1