Title
QMatch - Using paths to match XML schemas
Abstract
Integration of multiple heterogeneous data sources continues to be a critical problem for many application domains and a challenge for researchers world-wide. With the increasing popularity of the XML model and the proliferation of XML documents on-line, automated matching of XML documents and databases has become a critical issue. In this paper, we present a hybrid schema match algorithm, QMatch, that provides a unique path-based framework for harnessing traditional structural and semantic information, while exploiting the constraints inherent in XML documents such as the order of XML elements, to provide improved levels of matching between two given XML schemata. QMatch is based on the measurement of a unique quality of match metric, QoM, and a set of classifiers which together provide not only an effective basis for the development of a new schema match algorithm, but also a useful tool for tuning existing schema match algorithms to output at desired levels of matching. In this paper, we show via a set of experiments the benefits of the path-based QMatch over existing structural, linguistic, and hybrid algorithms such as Cupid, and provide an empirical measure of the accuracy of QMatch in terms of the true matches discovered by the algorithm.
Year
DOI
Venue
2007
10.1016/j.datak.2006.03.002
Data Knowl. Eng.
Keywords
Field
DocType
true match,xml element,xml model,hybrid schema matching,xml schema,schema matching,xml schema matching,hybrid schema match algorithm,path-based qmatch,existing schema match algorithm,schema integration,match metric,new schema match algorithm,xml document
Data mining,XML Encryption,Efficient XML Interchange,Streaming XML,Information retrieval,XML validation,Computer science,Document Structure Description,RELAX NG,XML schema,Database,XML Schema Editor
Journal
Volume
Issue
ISSN
60
2
0169-023X
Citations 
PageRank 
References 
13
0.54
17
Authors
2
Name
Order
Citations
PageRank
Naiyana Tansalarak1282.34
Kajal T. Claypool258064.35