Title
Evaluation of an associative classifier based on position-constrained frequent/closed subtree mining
Abstract
Tree-structured data are popular in many domains making structural classification an important task. In this paper, an associative classification method is introduced based on a structure preserving flat representation of trees. A major difference to traditional tree mining techniques is that subtrees are constrained by the position in the original trees, leading to a drastic reduction in the number of rules generated, especially with data having great structural variation among tree instances. This characteristic would be desired in the current status of frequent pattern mining, where excessive patterns hinder the practical use of results. However the question remains whether this reduction comes at a high cost in accuracy and coverage rate reduction. We explore this aspect and compare the approach with a state-of-the-art structural classifier based on same subtree type, but not positional constrained in any way. We investigate the effect of using different types of frequent pattern (frequent or closed), or subtree types (induced, embedded or embedded-plus-disconnected subtrees) to the performance of the two classifiers. Different rule strength measures such as confidence, weighted confidence and likelihood are also examined in our study. The experiments on three real-world data sets reveal important similarities and differences between the methods.
Year
DOI
Venue
2015
10.1007/s10844-014-0312-9
Journal of Intelligent Information Systems
Keywords
Field
DocType
Tree mining,Structural classification,Database structure model,Association rule mining
Data mining,Data set,Associative property,Computer science,Associative classifier,Artificial intelligence,Classifier (linguistics),Tree mining,Pattern recognition,Structural classification,Tree (data structure),Association rule learning,Machine learning
Journal
Volume
Issue
ISSN
45
3
0925-9902
Citations 
PageRank 
References 
1
0.36
36
Authors
4
Name
Order
Citations
PageRank
Dang Bach Bui151.84
Fedja Hadzic217515.55
Andrea Tagarelli347552.29
michael hecker410.70