Title
Attribute-Oriented Induction Using Domain Generalization Graphs
Abstract
Attribute-oriented induction summarizes the information in a relational database by repeatedly replacing specific attribute values with more general concepts according to user-defined concept hierarchies. We show how domain generalization graphs can be constructed from multiple concept hierarchies associated with an attribute, describe how these graphs can be used to control the generalization of a set of attributes, and present the Multi-Attribute Generalization algorithm for attribute-oriented induction using domain generalization graphs. Based upon a generate-and-test approach, the algorithm generates all possible combinations of nodes from the domain generalization graphs associated with the individual attributes, to produce all possible generalized relations for the set of attributes. We rank the interestingness of the resulting generalized relations using measures based upon relative entropy and variance. Our experiments show that these measures provide a basis for analyzing summary data from relational databases. Variance appears more useful because it tends to rank the less complex generalized relations (i.e., those with few attributes and/or few tuples) as more interesting.
Year
DOI
Venue
1996
10.1109/TAI.1996.560458
ICTAI
Keywords
Field
DocType
general concept,multi-attribute generalization algorithm,complex generalized relation,attribute-oriented induction,possible generalized relation,multiple concept,domain generalization graph,individual attribute,generalized relation,domain generalization graphs,possible combination,generic algorithm,relational database,data mining,relational databases,machine learning,graph theory
Graph theory,Superkey,Relational calculus,Relational database,Computer science,Tuple,Artificial intelligence,Relational model,Machine learning,Kullback–Leibler divergence,Attribute domain
Conference
ISSN
ISBN
Citations 
1082-3409
0-8186-7686-8
23
PageRank 
References 
Authors
1.84
9
3
Name
Order
Citations
PageRank
Howard J. Hamilton11501145.55
Robert J. Hilderman227029.86
Nick Cercone31999570.62