Abstract | ||
---|---|---|
Locality Sensitive Hashing (LSH) is widely used for efficient retrieval of candidate matches in very large audio, video, and image systems. However, extremely large reference databases necessitate a guaranteed limit on the memory used by the table lookup itself, no matter how the entries crowd different parts of the signature space, a guarantee that LSH does not give. In this paper, we provide such guaranteed limits, primarily through the design of the LSH bands. When combined with data-adaptive bin splitting (needed on only 0.04% of the occupied bins) this approach provides the required guarantee on memory usage. At the same time, it avoids the reduced recall that more extensive use of bin splitting would give. |
Year | DOI | Venue |
---|---|---|
2009 | 10.1109/ICASSP.2009.4959971 | ICASSP |
Keywords | Field | DocType |
locality sensitive hashing,bin splitting,fingerprint identification,lsh band,required guarantee,memory usage,large audio,index terms— multimedia databases,large-scale retrieval,lsh banding,information retrieval,guaranteed limit,large reference databases,occupied bin,data-adaptive bin splitting,pattern matching,fingerprint recognition,data mining,solids,image retrieval,probability density function,memory management,indexing terms,entropy,mutual information | Locality-sensitive hashing,Data mining,Pattern recognition,Bin,Computer science,Fingerprint,Memory management,Artificial intelligence,Recall,Probability density function,Pattern matching | Conference |
ISSN | Citations | PageRank |
1520-6149 | 8 | 0.49 |
References | Authors | |
8 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Michele Covell | 1 | 706 | 78.42 |
Shumeet Baluja | 2 | 4053 | 728.83 |