Title
Visreduce: Fast And Responsive Incremental Information Visualization Of Large Datasets
Abstract
Performance and responsiveness of visual analytics sytems for exploratory data analysis of large datasets has been a long standing problem. We propose a method for incrementally computing visualizations in a distributed fashion by combining a modified MapReduce-style algorithm with a compressed columnar data store, resulting in significant improvements in performance and responsiveness for constructing commonly encountered information visualizations, e.g. bar charts, scatterplots, heat maps, cartograms and parallel coordinate plots. We compare our method with one that queries three other readily available database and data warehouse systems - PostgreSQL, Cloudera Impala and the MapReduce-based Apache Hive - in order to build visualizations. We show that our end-to-end approach allows for greater speed and guaranteed end-user responsiveness, even in the face of large, long-running queries.
Year
Venue
Keywords
2013
2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA
incremental visualization, online aggregation, information visualization, MapReduce, columnar storage
Field
DocType
ISSN
SQL,Data warehouse,Data mining,Data visualization,Bar chart,Information visualization,Computer science,Visual analytics,Artificial intelligence,Online aggregation,Exploratory data analysis,Machine learning
Conference
2639-1589
Citations 
PageRank 
References 
10
0.49
19
Authors
3
Name
Order
Citations
PageRank
Jean-Francois Im1331.47
Felix Giguere Villegas2100.49
Michael J. McGuffin398954.52