Tree Pca For Extracting Dominant Substructures From Labeled Rooted Trees - Citegraph

Paper Info

Title
Tree Pca For Extracting Dominant Substructures From Labeled Rooted Trees

Abstract
We propose novel principal component analysis (PCA) for rooted labeled trees to discover dominant substructures from a collection of trees. The principal components of trees are defined in analogy to the ordinal principal component analysis on numerical vectors. Our methods substantially extend earlier work, in which the input data are restricted to binary trees or rooted unlabeled trees with unique vertex indexing, and the principal components are also restricted to the form of paths. In contrast, our extension allows the input data to accept general rooted labeled trees, and the principal components to have more expressive forms of subtrees instead of paths. For this extension, we can employ the technique of flexible tree matching; various mappings used in tree edit distance algorithms. We design an efficient algorithm using top-down mappings based on our framework, and show the applicability of our algorithm by applying it to extract dominant patterns from a set of glycan structures.

Year	DOI	Venue
2015	10.1007/978-3-319-24282-8_27	DISCOVERY SCIENCE, DS 2015
Field	DocType	Volume
Edit distance,Data mining,Pattern recognition,Vertex (geometry),Ordinal number,Computer science,Binary tree,Search engine indexing,Tree edit distance,Artificial intelligence,Principal component analysis	Conference	9356
ISSN	Citations	PageRank
0302-9743	0	0.34
References	Authors
5	3

Authors (3 rows)

Cited by (0 rows)

References (5 rows)

Name	Order	Citations	PageRank
Tomoya Yamazaki	1	0	0.34
Akihiro Yamamoto	2	135	26.84
Tetsuji Kuboyama	3	140	29.36

1