Abstract | ||
---|---|---|
Three approaches to content-and-structure XML retrieval are analysed in this paper: rst by using Zettair, a full- text information retrieval system; second by using eXist, a native XML database, and third by using a hybrid XML re- trieval system that uses eXist to produce the nal answers from likely relevant articles retrieved by Zettair. INEX 2003 content-and-structure topics can be classied in two cate- gories: the rst retrieving full articles as nal answers, and the second retrieving more specic elements within articles as nal answers. We show that for both topic categories our initial hybrid system improves the retrieval eectiv eness of a native XML database. For ranking the nal answer elements, we propose and evaluate a novel retrieval model that utilises the structural relationships between the answer elements of a native XML database and retrieves Coherent Retrieval Elements. The nal results of our experiments show that when the XML retrieval task focusses on highly relevant elements our hybrid XML retrieval system with the Coherent Retrieval Elements module is 1.8 times more eec- tive than Zettair and 3 times more eectiv e than eXist, and yields an eectiv e content-and-structure XML retrieval. |
Year | Venue | Keywords |
---|---|---|
2005 | Clinical Orthopaedics and Related Research | xml information retrieval,native xml database,inex,zettair,exist,hybrid system,information retrieval,information retrieval system |
Field | DocType | Volume |
Efficient XML Interchange,Streaming XML,Information retrieval,XML validation,Computer science,Document Structure Description,XML database,XML schema,XML Schema Editor,XML Signature | Journal | abs/cs/0508017 |
Citations | PageRank | References |
1 | 0.38 | 8 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Jovan Pehcevski | 1 | 199 | 13.72 |
James A. Thom | 2 | 622 | 182.05 |
Anne-Marie Vercoustre | 3 | 331 | 81.83 |