Title
Dimensionality reduction using magnitude and shape approximations
Abstract
High dimensional data sets are encountered in many modern database applications. The usual approach is to construct a summary of the data set through a lossy compression technique, and use this lower dimensional synopsis to provide fast, approximate answers to the queries. In this paper, we develop a novel dimensionality reduction technique based on partitioning the high dimensional vector space into orthogonal subspaces. First, we find a relation between the Euclidian distance of two n-dimensional vectors and the Euclidian distances of their projections on the orthogonal subspaces. Then, based on this relation we develop a method to approximate the Euclidian distance using novel inner product approximation. This process allows us to incorporate the shape information of the vectors to this approximation. While the inner product approximation is symmetric, i.e., captures only the magnitude information of the data, the proposed method takes both the magnitude and shape information of the original vectors into account through partitioning. In the experiments, we demonstrate the effectiveness of our technique by comparing it with commonly used methods.
Year
DOI
Venue
2003
10.1145/956863.956883
CIKM
Keywords
Field
DocType
lossy compression technique,inner product approximation,high dimensional vector space,shape information,novel dimensionality reduction technique,orthogonal subspaces,lower dimensional synopsis,shape approximation,euclidian distance,dimensionality reduction,high dimensional data set,magnitude information,lossy compression,inner product,high dimensional data,similarity search,vector space
Data mining,Magnitude (mathematics),Vector space,Clustering high-dimensional data,Mathematical optimization,Dimensionality reduction,Lossy compression,Computer science,Euclidean distance,Algorithm,Linear subspace,Nearest neighbor search
Conference
ISBN
Citations 
PageRank 
1-58113-723-0
11
0.83
References 
Authors
27
2
Name
Order
Citations
PageRank
Ümit Y. Ogras120315.03
Hakan Ferhatosmanoglu2135289.79