Abstract | ||
---|---|---|
We develop a framework for representing XML documents and queries in vector spaces and build indexes for processing text-centric semi-structured queries that support a proximity measure between XML documents. The idea of using vector spaces for XML retrieval is not new. In this paper we (i) unify prior approaches into a single framework; (ii) develop techniques to eliminate special purpose auxiliary computations (outside the vector space) used previously; (iii) give experimental evidence on benchmark queries that our approach is competitive in its retrieval quality and (iv) as an immediate consequence of the framework, are able to classify and cluster XML documents. |
Year | DOI | Venue |
---|---|---|
2005 | 10.1007/978-3-540-31865-1_8 | ECIR |
Keywords | Field | DocType |
immediate consequence,single framework,retrieval quality,cluster xml document,experimental evidence,auxiliary computation,vector space,xml retrieval,xml document,benchmark query,encoding xml,indexation | Data mining,XML Encryption,Efficient XML Interchange,XML framework,Streaming XML,Information retrieval,Computer science,XML validation,Document Structure Description,XML schema,XML Schema Editor | Conference |
Volume | ISSN | ISBN |
3408 | 0302-9743 | 3-540-25295-9 |
Citations | PageRank | References |
13 | 0.86 | 25 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Vinay Kakade | 1 | 24 | 1.60 |
Prabhakar Raghavan | 2 | 13351 | 2776.61 |