Title | ||
---|---|---|
ivhd: A Robust Linear-Time and Memory Efficient Method for Visual Exploratory Data Analysis. |
Abstract | ||
---|---|---|
Data embedding (DE) and graph visualization (GV) methods are very compatible tools used in Exploratory Data Analysis for visualization of complex data such as high-dimensional data and complex networks. However, high computational complexity and memory load of existing DE and GV algorithms, considerably hinders visualization of truly large and big data consisting of as many as M~106+ data objects and N~103+ dimensions. Recently, we have shown that by employing only a small fraction of distances between data objects one can obtain very satisfactory reconstruction of topology of a complex data in 2D in a linear-time O(M). In this paper, we demonstrate the high robustness of our approach. We show that even poor approximations of the nn-nearst neighbor graph, representing high-dimensional data, can yield acceptable data embeddings. Furthermore, some incorrectness in the nearest neighbor list can often be useful to improve the quality of data visualization. This robustness of our DE method, together with its high memory and time efficiency, meets perfectly the requirements of big and distributed data visualization, when finding the accurate nearest neighbor list represents a great computational challenge. |
Year | Venue | Field |
---|---|---|
2017 | MLDM | Graph drawing,Data visualization,Pattern recognition,Visualization,Computer science,Complex data type,Robustness (computer science),Artificial intelligence,Time complexity,Exploratory data analysis,Big data,Machine learning |
DocType | Citations | PageRank |
Conference | 1 | 0.36 |
References | Authors | |
8 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Witold Dzwinel | 1 | 132 | 25.14 |
Rafal Wcisło | 2 | 29 | 7.63 |