Abstract | ||
---|---|---|
In this paper, we propose data space mapping techniques for storage and retrieval in multi-dimensional databases on multi-disk architectures. We identify the important factors for an efficient multi-disk searching of multi-dimensional data and develop secondary storage organization and retrieval techniques that directly address these factors. We especially focus on high dimensional data, where none of the current approaches are effective. In contrast to the current declustering techniques, storage techniques in this paper consider both inter- and intra-disk organization of the data. The data space is first partitioned into buckets, then the buckets are declustered to multiple disks while they are clustered in each disk. The queries are executed through bucket identification techniques that locate the pages. One of the partitioning techniques we discuss is especially practical for high dimensional data, and our disk and page allocation techniques are optimal with respect to number of I/O accesses and seek times. We provide experimental results that support our claims on two real high dimensional datasets. |
Year | DOI | Venue |
---|---|---|
2007 | 10.1016/j.is.2005.06.001 | Inf. Syst. |
Keywords | Field | DocType |
large multi-dimensional databases,real high dimensional datasets,space partitioning,disk and page allocation,efficient multi-disk,multi-dimensional data,current approach,current declustering technique,data space mapping technique,multi-disk architectures,performance,data space,data space mapping,storage technique,parallel i/o,high dimensional data,secondary storage organization,storage,space mapping | Space partitioning,Data mining,Multi dimensional,Clustering high-dimensional data,Data space,Computer science,Input/output,Parallel I/O,Database,Auxiliary memory | Journal |
Volume | Issue | ISSN |
32 | 1 | Information Systems |
Citations | PageRank | References |
0 | 0.34 | 35 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Hakan Ferhatosmanoglu | 1 | 1352 | 89.79 |
Aravind Ramachandran | 2 | 26 | 1.44 |
Divyakant Agrawal | 3 | 8201 | 1674.75 |
Amr El Abbadi | 4 | 6767 | 1569.95 |