Title
Tutorial: Are You My Neighbor?: Bringing Order to Neighbor Computing Problems.
Abstract
Finding nearest neighbors is an important topic that has attracted much attention over the years and has applications in many fields, such as market basket analysis, plagiarism and anomaly detection, community detection, ligand-based virtual screening, etc. As data are easier and easier to collect, finding neighbors has become a potential bottleneck in analysis pipelines. Performing pairwise comparisons given the massive datasets of today is no longer feasible. The high computational complexity of the task has led researchers to develop approximate methods, which find many but not all of the nearest neighbors. Yet, for some types of data, efficient exact solutions have been found by carefully partitioning or filtering the search space in a way that avoids most unnecessary comparisons. In recent years, there have been several fundamental advances in our ability to efficiently identify appropriate neighbors, especially in non-traditional data, such as graphs or document collections. In this tutorial, we provide an in-depth overview of recent methods for finding (nearest) neighbors, focusing on the intuition behind choices made in the design of those algorithms and on the utility of the methods in real-world applications. Our tutorial aims to provide a unifying view of "neighbor computing" problems, spanning from numerical data to graph data, from categorical data to sequential data, and related application scenarios. For each type of data, we will review the current state-of-the-art approaches used to identify neighbors and discuss how neighbor search methods are used to solve important problems.
Year
DOI
Keywords
2019
10.1145/3292500.3332292
graph construction, nearest neighbor search, sparse data
Field
DocType
ISSN
Anomaly detection,Bottleneck,Pairwise comparison,Categorical variable,Computer science,Data type,Artificial intelligence,Affinity analysis,Nearest neighbor search,Machine learning,Computational complexity theory
Conference
978-1-4503-6201-6
ISBN
Citations 
PageRank 
978-1-4503-6201-6
0
0.34
References 
Authors
0
3
Name
Order
Citations
PageRank
David C. Anastasiu15911.17
Huzefa Rangwala243557.50
Andrea Tagarelli347552.29