Title
Visual Analytics Of Large Multi-Dimensional Data Using Variable Binned Scatter Plots
Abstract
The scatter plot is a well-known method of visualizing pairs of two-dimensional continuous variables. Multi-dimensional data can be depicted in a scatter plot matrix. They are intuitive and easy-to-use, but often have a high degree of overlap which may occlude a significant portion of data. In this paper, we propose variable binned scatter plots to allow the visualization of large amounts of data without overlapping. The basic idea is to use a non-uniform (variable) binning of the x and y dimensions and plots all the data points that fall within each bin into corresponding squares. Further, we map a third attribute to color for visualizing clusters. Analysts are able to interact with individual data points for record level information. We have applied these techniques to solve real-world problems on credit card fraud and data center energy consumption to visualize their data distribution and cause-effect among multiple attributes. A comparison of our methods with two recent well-known variants of scatter plots is included.
Year
DOI
Venue
2010
10.1117/12.840142
VISUALIZATION AND DATA ANALYSIS 2010
Keywords
Field
DocType
Variable Binned, Scatter plots, Correlations, Clusters, Cause-Effect, Data Distribution
Data mining,Bin,Computer science,Visual analytics,Artificial intelligence,Cluster analysis,Data point,Computer vision,Bivariate data,Pattern recognition,Visualization,Statistical graphics,Scatter plot
Conference
Volume
ISSN
Citations 
7530
0277-786X
3
PageRank 
References 
Authors
0.44
4
5
Name
Order
Citations
PageRank
Ming C. Hao130.44
Umeshwar Dayal284522538.92
Ratnesh K. Sharma348353.37
Daniel A. Keim477041141.60
Halldor Janetzko531220.69