Title
K nearest neighbours with mutual information for simultaneous classification and missing data imputation
Abstract
Missing data is a common drawback in many real-life pattern classification scenarios. One of the most popular solutions is missing data imputation by the K nearest neighbours (KNN) algorithm. In this article, we propose a novel KNN imputation procedure using a feature-weighted distance metric based on mutual information (MI). This method provides a missing data estimation aimed at solving the classification task, i.e., it provides an imputed dataset which is directed toward improving the classification performance. The MI-based distance metric is also used to implement an effective KNN classifier. Experimental results on both artificial and real classification datasets are provided to illustrate the efficiency and the robustness of the proposed algorithm.
Year
DOI
Venue
2009
10.1016/j.neucom.2008.11.026
Neurocomputing
Keywords
Field
DocType
effective knn classifier,novel knn imputation procedure,mutual information,missing data pattern classification imputation k nearest neighbours mutual information,imputation,missing data estimation,real classification datasets,pattern classification,k nearest neighbours,missing data,classification task,simultaneous classification,missing data imputation,real-life pattern classification scenario,mi-based distance metric,classification performance,distance metric
Data mining,Pattern recognition,Metric (mathematics),Robustness (computer science),Artificial intelligence,Mutual information,Missing data,Imputation (statistics),Classifier (linguistics),Machine learning,Mathematics,Missing data imputation
Journal
Volume
Issue
ISSN
72
7-9
Neurocomputing
Citations 
PageRank 
References 
48
1.41
23
Authors
4
Name
Order
Citations
PageRank
Pedro J. García-Laencina127514.14
José-Luis Sancho-Gómez218217.26
Aníbal R. Figueiras-Vidal346738.03
Michel Verleysen42291221.75