Title
Index and Search XML Documents by Combining Content and Structure
Abstract
By nesting data, XML format allows embedding additional semantic which is not possible using flat text for mat. Obviously, capturing this semantic will enhance the effectiveness of the searching process in an XML corpus. Many approaches address the XML searching problem. Approaches stemmed from database communities are concentrated on the data structure. In this case, users have to express their information need in a complex query language. Moreover, users must have a good knowledge of the document structure. Approaches using information retrieval techniques are concentrated on document content. In this case the search results are not effective because the loss of the semantic conducted by the structure. This paper presents an XML retrieval system within the reach of both expert and naive users. When indexing an XML document, the system takes into account both the document content and the document structure. To query the system, a user can issue both simple queries, i.e. a bag of keywords and more complex queries using boolean operators and operators referring to document structure.
Year
Venue
Keywords
2006
International Conference on Internet Computing
: xml retrieval,information retrieval.,structured search,data structure,document structure,information need,information retrieval,xml document,query language,indexation
Field
DocType
Citations 
XML framework,Streaming XML,Well-formed document,Information retrieval,Computer science,XML validation,Document Structure Description,XML schema,Simple API for XML,Document type definition
Conference
0
PageRank 
References 
Authors
0.34
9
3
Name
Order
Citations
PageRank
Faiza Abbaci1393.61
jeanbaptiste valsamis210.72
Pascal Francq3947.59