Abstract | ||
---|---|---|
Bloom filters are space efficient, probabilistic data structures for membership queries with low false positives and zero false negatives. They are widely used in the field of big data, networking, bio-informatics, cloud computing and IoT. However, a standard bloom filter does not support fuzzy membership queries. To overcome this limitation, we propose a novel algorithm using locality sensitive hashing on strings which extends the capability of a bloom filter to support fuzzy queries and calculate a score indicating its degree of similarity with the dataset. Experiments are performed to analyze false positive and false negative rates along with space requirements, time taken to serve a query and time taken to build the data structure. The proposed algorithm is benchmarked on two real world datasets where it outperforms all the baselines while maintaining all the advantages of a standard bloom filter. |
Year | DOI | Venue |
---|---|---|
2021 | 10.1109/ICSC50631.2021.00010 | 2021 IEEE 15th International Conference on Semantic Computing (ICSC) |
Keywords | DocType | ISSN |
Bloom Filter,Fuzzy Bloom Filter,Locality Sensitive Hashing,Fuzzy Matching | Conference | 2325-6516 |
ISBN | Citations | PageRank |
978-1-7281-8900-0 | 0 | 0.34 |
References | Authors | |
0 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Rishabh Kumar | 1 | 0 | 0.34 |
Hari Prasanna P | 2 | 0 | 0.34 |
Mukund Rungta | 3 | 0 | 1.01 |
Swetha Kashinath Phuleker | 4 | 0 | 0.34 |
Hemant Tiwari | 5 | 0 | 0.68 |
Vanraj Vala | 6 | 0 | 0.68 |