Title
Nugget Browser: Visual Subgroup Mining and Statistical Significance Discovery in Multivariate Datasets
Abstract
Discovering interesting patterns in datasets is a very important data mining task. Subgroup patterns are local findings identifying the subgroups of a population with some unusual, unexpected, or deviating distribution of a target attribute. However, this pattern discovery task poses several compelling challenges. First, computational data mining techniques can generally only discover and extract pre-defined patterns. Second, since the extracted patterns are typically multi-dimensional arbitrary-shaped regions, it is very difficult to convey in an easily interpretable manner. Finally, in order to assist analysts in exploring their discoveries and understanding the relationships among patterns, as well as connections between patterns and the underlying data instances, an integrated visualization system is greatly needed. In this paper, we present a novel subgroup pattern extraction and visualization system, called the Nugget Browser, that takes advantage of both data mining methods and interactive visual exploration. The system accepts analysts' mining queries interactively, converts the query results into an understandable form, builds visual representations, and supports navigation and exploration for further analyses.
Year
DOI
Venue
2011
10.1109/IV.2011.21
IV
Keywords
Field
DocType
visualization system,pattern discovery task,data mining method,interesting pattern,multivariate datasets,important data mining task,visual subgroup mining,computational data mining technique,nugget browser,integrated visualization system,novel subgroup pattern extraction,interactive visual exploration,statistical significance,underlying data instance,visual system,data visualization,interactive visualization,statistical analysis,feature extraction,data mining,indexes,space exploration,color,visualization
Data mining,Population,Concept mining,Data stream mining,Information retrieval,Information visualization,Visualization,Computer science,Visual analytics,Interactive visual analysis,K-optimal pattern discovery
Conference
ISSN
Citations 
PageRank 
1550-6037
1
0.35
References 
Authors
19
3
Name
Order
Citations
PageRank
Zhenyu Guo151239.61
Matthew O. Ward21757189.48
Elke A. Rundensteiner34076700.65