Title
ivhd: A Robust Linear-Time and Memory Efficient Method for Visual Exploratory Data Analysis.
Abstract
Data embedding (DE) and graph visualization (GV) methods are very compatible tools used in Exploratory Data Analysis for visualization of complex data such as high-dimensional data and complex networks. However, high computational complexity and memory load of existing DE and GV algorithms, considerably hinders visualization of truly large and big data consisting of as many as M~106+ data objects and N~103+ dimensions. Recently, we have shown that by employing only a small fraction of distances between data objects one can obtain very satisfactory reconstruction of topology of a complex data in 2D in a linear-time O(M). In this paper, we demonstrate the high robustness of our approach. We show that even poor approximations of the nn-nearst neighbor graph, representing high-dimensional data, can yield acceptable data embeddings. Furthermore, some incorrectness in the nearest neighbor list can often be useful to improve the quality of data visualization. This robustness of our DE method, together with its high memory and time efficiency, meets perfectly the requirements of big and distributed data visualization, when finding the accurate nearest neighbor list represents a great computational challenge.
Year
Venue
Field
2017
MLDM
Graph drawing,Data visualization,Pattern recognition,Visualization,Computer science,Complex data type,Robustness (computer science),Artificial intelligence,Time complexity,Exploratory data analysis,Big data,Machine learning
DocType
Citations 
PageRank 
Conference
1
0.36
References 
Authors
8
2
Name
Order
Citations
PageRank
Witold Dzwinel113225.14
Rafal Wcisło2297.63