Title
Transformation Of Xml Data Sources For Sequential Path Mining
Abstract
In recent years XML has become one of the most promising ways to define semi-structured data. Data mining techniques devised for detecting interesting patterns from semi-structure data have also grown in popularity, but carrying out such techniques on XML data can be problematic due to its hierarchical structure. Therefore, it has become necessary to transform XML into flattened, path data, so as to enable data mining to be carried out efficiently. However, problems may arise when the XML tree needs to be reconstructed from the traversal path. There are currently many transformation techniques for XML data, many of which take advantage of its tree-like hierarchical structure; but most of these approaches do not allow the XML tree to be reconstructed from the traversal path. In this paper we propose a new approach to the transformation of XML data into path data. The new approach employs a 5 step transformation process along with a new 'Postorder Sequencing' method of traversing the XML tree. The proposed method, on the one hand, can be seen an efficient and effective way of transforming XML data into collections of paths, and on the other hand enables XML trees to be generated from the traversal paths.
Year
DOI
Venue
2016
10.1007/978-3-319-69781-9_15
WEB AND BIG DATA
Keywords
Field
DocType
XML, Transformation, XPath, Sequential data mining
Data mining,Efficient XML Interchange,XML framework,Streaming XML,Computer science,XML validation,XML database,Theoretical computer science,XML schema,XML tree,XML Signature
Conference
Volume
ISSN
Citations 
10612
0302-9743
0
PageRank 
References 
Authors
0.34
0
4
Name
Order
Citations
PageRank
Ruth McNerlan100.34
Yaxin Bi254147.76
Guoze Zhao301.01
Bing Han4279.29