Title
A reliable order-statistics-based approximate nearest neighbor search algorithm
Abstract
We propose a new algorithm for fast approximate nearest neighbor search based on the properties of ordered vectors. Data vectors are classified based on the index and sign of their largest components, thereby partitioning the space in a number of cones centered in the origin. The query is itself classified, and the search starts from the selected cone and proceeds to neighboring ones. Overall, the proposed algorithm corresponds to locality sensitive hashing in the space of directions, with hashing based on the order of components. Thanks to the statistical features emerging through ordering, it deals very well with the challenging case of unstructured data, and is a valuable building block for more complex techniques dealing with structured data. Experiments on both simulated and real-world data prove the proposed algorithm to provide a state-of-the-art performance.
Year
DOI
Venue
2015
10.1109/TIP.2016.2624141
IEEE Trans. Image Processing
Keywords
Field
DocType
Quantization (signal),Approximation algorithms,Partitioning algorithms,Dictionaries,Indexes,Sorting,Reliability
Best bin first,Computer science,Unstructured data,Theoretical computer science,Artificial intelligence,Order statistic,Nearest neighbor search,Locality-sensitive hashing,Pattern recognition,Algorithm,Hash function,Data model,Machine learning
Journal
Volume
Issue
ISSN
abs/1509.03453
1
1057-7149
Citations 
PageRank 
References 
1
0.35
24
Authors
3
Name
Order
Citations
PageRank
Luisa Verdoliva197157.12
Davide Cozzolino235819.37
Giovanni Poggi365553.64