Title
Proseco: Visual Analysis Of Class Separation Measures And Dataset Characteristics
Abstract
Class separation is an important concept in machine learning and visual analytics. We address the visual analysis of class separation measures for both high-dimensional data and its corresponding projections into 2D through dimensionality reduction (DR) methods. Although a plethora of separation measures have been proposed, it is difficult to compare class separation between multiple datasets with different characteristics, multiple separation measures, and multiple DR methods. We present ProSeCo, an interactive visualization approach to support comparison between up to 20 class separation measures and up to 4 DR methods, with respect to any of 7 dataset characteristics: dataset size, dataset dimensions, class counts, class size variability, class size skewness, outlieriness, and real-world vs. synthetically generated data. ProSeCo supports (1) comparing across measures, (2) comparing high-dimensional to dimensionallyreduced 2D data across measures, (3) comparing between different DR methods across measures, (4) partitioning with respect to a dataset characteristic, (5) comparing partitions for a selected characteristic across measures, and (6) inspecting individual datasets in detail. We demonstrate the utility of ProSeCo in two usage scenarios, using datasets [1] posted at https://osf.io/epcf9/ .(c) 2021 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license ( http://creativecommons.org/licenses/by/4.0/ )
Year
DOI
Venue
2021
10.1016/j.cag.2021.03.004
COMPUTERS & GRAPHICS-UK
Keywords
DocType
Volume
Computers and Graphics, Formatting, Guidelines
Journal
96
ISSN
Citations 
PageRank 
0097-8493
0
0.34
References 
Authors
0
5
Name
Order
Citations
PageRank
Jürgen Bernard133432.32
Marco Hutter200.34
Matthias Zeppelzauer302.37
Michael Sedlmair491551.74
Tamara Munzner52562147.34