Title
Large-scale cluster-based retrieval experiments on Turkish texts
Abstract
We present cluster-based retrieval (CBR) experiments on the largest available Turkish document collection. Our experiments evaluate retrieval effectiveness and efficiency on both an automatically generated clustering structure and a manual classification of documents. In particular, we compare CBR effectiveness with full-text search (FS) and evaluate several implementation alternatives for CBR. Our findings reveal that CBR yields comparable effectiveness figures with FS. Furthermore, by using a specifically tailored cluster-skipping inverted index we significantly improve in-memory query processing efficiency of CBR in comparison to other traditional CBR techniques and even FS.
Year
DOI
Venue
2007
10.1145/1277741.1277961
SIGIR
Keywords
Field
DocType
implementation alternative,in-memory query processing efficiency,cbr effectiveness,comparable effectiveness figure,retrieval effectiveness,turkish text,full-text search,traditional cbr technique,cbr yield,cluster-based retrieval,large-scale cluster-based retrieval experiment,clustering structure,inverted index
Inverted index,Data mining,Turkish,Information retrieval,Computer science,Cluster analysis
Conference
Citations 
PageRank 
References 
6
0.50
5
Authors
5
Name
Order
Citations
PageRank
Ismail Sengor Altingovde132029.96
Rifat Ozcan219212.83
Huseyin Cagdas Ocalan3512.85
Fazli Can458194.63
Özgür Ulusoy51250113.15