Neighbor-Sensitive Hashing. - Citegraph

Paper Info

Title
Neighbor-Sensitive Hashing.

Abstract
Approximate kNN (k-nearest neighbor) techniques using binary hash functions are among the most commonly used approaches for overcoming the prohibitive cost of performing exact kNN queries. However, the success of these techniques largely depends on their hash functions' ability to distinguish kNN items; that is, the kNN items retrieved based on data items' hashcodes, should include as many true kNN items as possible. A widely-adopted principle for this process is to ensure that similar items are assigned to the same hashcode so that the items with the hashcodes similar to a query's hashcode are likely to be true neighbors. In this work, we abandon this heavily-utilized principle and pursue the opposite direction for generating more effective hash functions for kNN tasks. That is, we aim to increase the distance between similar items in the hashcode space, instead of reducing it. Our contribution begins by providing theoretical analysis on why this revolutionary and seemingly counter-intuitive approach leads to a more accurate identification of kNN items. Our analysis is followed by a proposal for a hashing algorithm that embeds this novel principle. Our empirical studies confirm that a hashing algorithm based on this counter-intuitive idea significantly improves the efficiency and accuracy of state-of-the-art techniques.

Year	DOI	Venue
2015	10.14778/2850583.2850589	PVLDB
DocType	Volume	Issue
Journal	9	3
Citations	PageRank	References
9	0.47	35
Authors
3

Authors (3 rows)

Cited by (9 rows)

References (35 rows)

Name	Order	Citations	PageRank
Yongjoo Park	1	99	5.93
Michael J. Cafarella	2	2246	144.15
Barzan Mozafari	3	819	38.21

1