Title
Relaxed global term weights for XML element search
Abstract
XML element search engines return XML elements which are part of XML documents as search results. Existing studies related to XML element search are brought from the information retrieval techniques for document search. There are some ways to calculate global weights of each term from statistics of XML elements with 1) the same path expression or 2) the same tag. In the first approach, the more complex a path expression is, the less the number of XML elements with the path expression becomes. This is a problem that global term weights may be calculated using statistics of a few XML elements. Such global weights are never global. The second approach also has a problem that it does not consider document structures of XML elements. To resolve the problems, we propose a method for calculating accurate global weights. In our method, we regard a path expression as an array of tags. We relax the restriction of appearance order and appearance frequency of tags in a path expression to gather similar path expressions into the same class. Therefore, we try to decrease the number of classes which hardly contain elements. Our experimental results show that our method can integrate path expressions without decreasing search accuracy with a certain test collection.
Year
DOI
Venue
2010
10.1007/978-3-642-23577-1_6
INEX
Keywords
Field
DocType
xml element search engine,document search,xml element,accurate global weight,path expression,similar path expression,xml element search,global term weight,xml document,global weight
Search engine,XML,Information retrieval,XML validation,Computer science,Path expression,XML schema
Conference
Volume
ISSN
Citations 
6932
0302-9743
2
PageRank 
References 
Authors
0.37
9
3
Name
Order
Citations
PageRank
Atsushi Keyaki1115.46
Kenji Hatano23010.41
Jun Miyazaki314624.75