Title
HyPerInsight: Data Exploration Deep Inside HyPer.
Abstract
Nowadays we are drowning in data of various varieties. For all these mixed types and categories of data there exist even more different analysis approaches, often done in single hand-written solutions. We propose to extend HyPer, a main memory database system to a uniform data agent platform following the one system fits all approach for solving a wide variety of data analysis problems. We achieve this by applying a flexible operator concept to a set of various important data exploration algorithms. With that, HyPer solves analytical questions using clustering, classification, association rule mining and graph mining besides standard HTAP (Hybrid Transaction and Analytical Processing) workloads on the same database state. It enables to approach the full variety and volume of HTAP extended for data exploration (HTAPx), and only needs knowledge of already introduced SQL extensions that are automatically optimized by the database's standard optimizer. In this demo we will focus on the benefits and flexibility we create by using the SQL extensions for several well-known mining workloads. In our interactive webinterface for this project named HyPerInsight we demonstrate how HyPer outperforms the best open source competitor Apache Spark in common use cases in social media, geo-data, recommender systems and several other.
Year
DOI
Venue
2017
10.1145/3132847.3133167
CIKM
Keywords
Field
DocType
HyPer, Database operators, Query processing, SQL, Apriori, k-Means, DBscan
SQL,Recommender system,k-means clustering,Data mining,Spark (mathematics),Computer science,Association rule learning,Database transaction,Cluster analysis,DBSCAN
Conference
ISBN
Citations 
PageRank 
978-1-4503-4918-5
0
0.34
References 
Authors
9
6
Name
Order
Citations
PageRank
Nina Hubig1134.64
Linnea Passing2141.03
Maximilian E. Schüle300.34
Dimitri Vorona493.23
Alfons Kemper53519769.50
Thomas Neumann62523156.50