Title
Mining frequent trees based on topology projection
Abstract
Methods for mining frequent trees are widely used in domains like bioinformatics, web-mining, chemical compound structure mining, and so on. In this paper, we present TG, an efficient pattern growth algorithm for mining frequent embedded suttees in a forest of rooted, labeled, and ordered trees. It uses rightmost path expansion scheme to construct complete pattern growth space, and creates a projected database for every grow point of the pattern ready to grow. Then, the problem is transformed from mining frequent trees to finding frequent nodes in the projected database. We conduct detailed experiments to test its performance and scalability and find that TG outperforms TreeMiner, one of the fastest methods proposed before, by a factor of 4 to 15.
Year
DOI
Venue
2005
10.1007/978-3-540-31849-1_39
APWeb
Keywords
Field
DocType
web mining
Data mining,Structure mining,Computer science,Theoretical computer science,Scalability
Conference
Volume
Issue
ISSN
3399
null
0302-9743
ISBN
Citations 
PageRank 
3-540-25207-X
0
0.34
References 
Authors
14
5
Name
Order
Citations
PageRank
Haibing Ma121.08
Wang Chen200.34
Ronglu Li3272.97
Liu Yong400.34
Yunfa Hu57413.44