Visual Analytics Of Large Multi-Dimensional Data Using Variable Binned Scatter Plots - Citegraph

Paper Info

Title
Visual Analytics Of Large Multi-Dimensional Data Using Variable Binned Scatter Plots

Abstract
The scatter plot is a well-known method of visualizing pairs of two-dimensional continuous variables. Multi-dimensional data can be depicted in a scatter plot matrix. They are intuitive and easy-to-use, but often have a high degree of overlap which may occlude a significant portion of data. In this paper, we propose variable binned scatter plots to allow the visualization of large amounts of data without overlapping. The basic idea is to use a non-uniform (variable) binning of the x and y dimensions and plots all the data points that fall within each bin into corresponding squares. Further, we map a third attribute to color for visualizing clusters. Analysts are able to interact with individual data points for record level information. We have applied these techniques to solve real-world problems on credit card fraud and data center energy consumption to visualize their data distribution and cause-effect among multiple attributes. A comparison of our methods with two recent well-known variants of scatter plots is included.

Year	DOI	Venue
2010	10.1117/12.840142	VISUALIZATION AND DATA ANALYSIS 2010
Keywords	Field	DocType
Variable Binned, Scatter plots, Correlations, Clusters, Cause-Effect, Data Distribution	Data mining,Bin,Computer science,Visual analytics,Artificial intelligence,Cluster analysis,Data point,Computer vision,Bivariate data,Pattern recognition,Visualization,Statistical graphics,Scatter plot	Conference
Volume	ISSN	Citations
7530	0277-786X	3
PageRank	References	Authors
0.44	4	5

Authors (5 rows)

Cited by (3 rows)

References (4 rows)

Name	Order	Citations	PageRank
Ming C. Hao	1	3	0.44
Umeshwar Dayal	2	8452	2538.92
Ratnesh K. Sharma	3	483	53.37
Daniel A. Keim	4	7704	1141.60
Halldor Janetzko	5	312	20.69

1