Abstract | ||
---|---|---|
We present cluster-based retrieval (CBR) experiments on the largest available Turkish document collection. Our experiments evaluate retrieval effectiveness and efficiency on both an automatically generated clustering structure and a manual classification of documents. In particular, we compare CBR effectiveness with full-text search (FS) and evaluate several implementation alternatives for CBR. Our findings reveal that CBR yields comparable effectiveness figures with FS. Furthermore, by using a specifically tailored cluster-skipping inverted index we significantly improve in-memory query processing efficiency of CBR in comparison to other traditional CBR techniques and even FS. |
Year | DOI | Venue |
---|---|---|
2007 | 10.1145/1277741.1277961 | SIGIR |
Keywords | Field | DocType |
implementation alternative,in-memory query processing efficiency,cbr effectiveness,comparable effectiveness figure,retrieval effectiveness,turkish text,full-text search,traditional cbr technique,cbr yield,cluster-based retrieval,large-scale cluster-based retrieval experiment,clustering structure,inverted index | Inverted index,Data mining,Turkish,Information retrieval,Computer science,Cluster analysis | Conference |
Citations | PageRank | References |
6 | 0.50 | 5 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ismail Sengor Altingovde | 1 | 320 | 29.96 |
Rifat Ozcan | 2 | 192 | 12.83 |
Huseyin Cagdas Ocalan | 3 | 51 | 2.85 |
Fazli Can | 4 | 581 | 94.63 |
Özgür Ulusoy | 5 | 1250 | 113.15 |