Title
Multi-Way Theta-Join Based On Cmd Storage Method
Abstract
In the era of the Big Data, how to analyze such a vast quantity of data is a challenging problem, and conducting a multi-way theta-join query is one of the most time consuming operations. MapReduce has been mentioned most in the massive data processing area and some join algorithms based on it have been raised in recent years. However, MapReduce paradigm itself may not be suitable to some scenarios and multi-way theta-join seems to be one of them. Many multi- way theta-join algorithms on traditional parallel database have been raised for many years, but no algorithm has been mentioned on the CMD (coordinate modulo distribution) storage method, although some algorithms on equal-join have been proposed. In this paper, we proposed a multi-way theta-join method based on CMD, which takes the advantage of the CMD storage method. Experiments suggest that it's a valid and efficient method which achieves significant improvement compared to those applied on the MapReduce.
Year
DOI
Venue
2014
10.1007/978-3-319-05810-8_5
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2014, PT I
Keywords
Field
DocType
CMD, Multi-way Theta-Join
Data mining,Data processing,Computer science,Modulo,Parallel database,Big data
Conference
Volume
ISSN
Citations 
8421
0302-9743
0
PageRank 
References 
Authors
0.34
6
4
Name
Order
Citations
PageRank
Lei Li1223.51
Hong Gao21086120.07
Mingrui Zhu300.34
Zhaonian Zou433115.78