Title
Gravitational fixed radius nearest neighbor for imbalanced problem
Abstract
We use the gravitational scenario into the fixed radius nearest neighbor rule.The proposed GFRNN deals with imbalanced classification problem.GFRNN does not need any manual parameter setting or coordination.Comparison experiments on 40 datasets validate its effectiveness and efficiency. This paper proposes a novel learning model that introduces the calculation of the pairwise gravitation of the selected patterns into the classical fixed radius nearest neighbor method, in order to overcome the drawback of the original nearest neighbor rule when dealing with imbalanced data. The traditional k nearest neighbor rule is considered to lose power on imbalanced datasets because the final decision might be dominated by the patterns from negative classes in spite of the distance measurements. Differently from the existing modified nearest neighbor learning model, the proposed method named GFRNN has a simple structure and thus becomes easy to work. Moreover, all parameters of GFRNN do not need initializing or coordinating during the whole learning procedure. In practice, GFRNN first selects patterns as candidates out of the training set under the fixed radius nearest neighbor rule, and then introduces the metric based on the modified law of gravitation in the physical world to measure the distance between the query pattern and each candidate. Finally, GFRNN makes the decision based on the sum of all the corresponding gravitational forces from the candidates on the query pattern. The experimental comparison validates both the effectiveness and the efficiency of GFRNN on forty imbalanced datasets, comparing to nine typical methods. As a conclusion, the contribution of this paper is constructing a new simple nearest neighbor architecture to deal with imbalanced classification effectively without any manually parameter coordination, and further expanding the family of the nearest neighbor based rules.
Year
DOI
Venue
2015
10.1016/j.knosys.2015.09.015
Knowledge-Based Systems
Keywords
Field
DocType
Fixed radius search,Nearest neighbor rule,Imbalanced data,Pattern classification
Data mining,R-tree,Fixed-radius near neighbors,Best bin first,Computer science,Ball tree,Nearest neighbor graph,Artificial intelligence,Cover tree,Large margin nearest neighbor,Machine learning,Nearest neighbor search
Journal
Volume
Issue
ISSN
90
C
0950-7051
Citations 
PageRank 
References 
12
0.56
31
Authors
3
Name
Order
Citations
PageRank
Yujin Zhu1365.28
Zhe Wang226818.89
Daqi Gao311016.30