Abstract | ||
---|---|---|
In recent years XML has become one of the most promising ways to define semi-structured data. Data mining techniques devised for detecting interesting patterns from semi-structure data have also grown in popularity, but carrying out such techniques on XML data can be problematic due to its hierarchical structure. Therefore, it has become necessary to transform XML into flattened, path data, so as to enable data mining to be carried out efficiently. However, problems may arise when the XML tree needs to be reconstructed from the traversal path. There are currently many transformation techniques for XML data, many of which take advantage of its tree-like hierarchical structure; but most of these approaches do not allow the XML tree to be reconstructed from the traversal path. In this paper we propose a new approach to the transformation of XML data into path data. The new approach employs a 5 step transformation process along with a new 'Postorder Sequencing' method of traversing the XML tree. The proposed method, on the one hand, can be seen an efficient and effective way of transforming XML data into collections of paths, and on the other hand enables XML trees to be generated from the traversal paths. |
Year | DOI | Venue |
---|---|---|
2016 | 10.1007/978-3-319-69781-9_15 | WEB AND BIG DATA |
Keywords | Field | DocType |
XML, Transformation, XPath, Sequential data mining | Data mining,Efficient XML Interchange,XML framework,Streaming XML,Computer science,XML validation,XML database,Theoretical computer science,XML schema,XML tree,XML Signature | Conference |
Volume | ISSN | Citations |
10612 | 0302-9743 | 0 |
PageRank | References | Authors |
0.34 | 0 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ruth McNerlan | 1 | 0 | 0.34 |
Yaxin Bi | 2 | 541 | 47.76 |
Guoze Zhao | 3 | 0 | 1.01 |
Bing Han | 4 | 27 | 9.29 |