Title
Dealing With Inliers In Feature Vector Data
Abstract
Inliers (bridge points) between clusters degrade the ability of many algorithms to find clusters in numerical data. We present three new approaches to the detection and removal of inliers. Two approaches are based on Local Outlier Factor (LOF) scores. We also discuss using LOF scores for an isolation Nearest Neighbour Ensemble (iNNE) approach to inlier detection. The third approach uses MaxiMin (MM) sampling to remove both inliers and outliers. We compare the three approaches on a synthetic and two real-life datasets. The failure of single linkage clustering due to the existence of bridging points is used as a means for evaluating the relative effectiveness of the three methods. We also show how inliers can degrade the quality of images built by the improved Visual Assessment of Tendency (iVAT) algorithm, which provides a visual representation of potential single linkage clusters in the data.
Year
DOI
Venue
2018
10.1142/S021848851840010x
INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS
Keywords
Field
DocType
Cluster tendency, anomaly suppression for single linkage, chaining, LOF, iNNE, anomaly corrected iVAT
Cluster (physics),Chaining,Feature vector,Artificial intelligence,Mathematics,Machine learning
Journal
Volume
Issue
ISSN
26
Supplement-2
0218-4885
Citations 
PageRank 
References 
0
0.34
8
Authors
6
Name
Order
Citations
PageRank
Dheeraj Kumar1729.96
Zahra Ghafoori200.68
James Bezdek3916.97
Christopher Leckie42422155.20
kotagiri ramamohanarao54716993.87
M. Palaniswami64107290.84