Abstract | ||
---|---|---|
The density-based spatial clustering of applications with noise (DBSCAN) algorithm has been well studied in database domains for clustering multi-dimensional data to extract arbitrary shape clusters. Recently, with the growing interest in big data and increasing diversification of data, the typical size and volume of databases have increased and data have increasingly become high-dimensional. Therefore, a large number of speed-up techniques for DBSCAN algorithms including exact and approximate approaches have been proposed. The fastest DBSCAN algorithm is the cell-based algorithm, which divides the whole data set into small cells. In this paper, we propose a novel exact version cell-based DBSCAN algorithm using minimum bounding rectangle (MBR) criteria. The connecting cells step is the most time-consuming step of the cell-based algorithm. The proposed algorithm can process the connecting cells step at high speed by using MBR criteria. We implemented the proposed cell-based DBSCAN algorithm and show that it outperforms the conventional one in high dimensions. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1007/978-3-319-55705-2_10 | DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2017) |
Keywords | Field | DocType |
DBSCAN, Density-based clustering, Cell-based DBSCAN algorithm, Minimum bounding rectangle | Minimum bounding rectangle,Minimum bounding box algorithms,Pattern recognition,Computer science,Artificial intelligence,Cluster analysis,Big data,DBSCAN,Minimum bounding box | Conference |
Volume | ISSN | Citations |
10179 | 0302-9743 | 4 |
PageRank | References | Authors |
0.38 | 5 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Tatsuhiro Sakai | 1 | 10 | 4.71 |
Keiichi Tamura | 2 | 37 | 13.86 |
H. Kitakami | 3 | 94 | 49.68 |