Title
DMP_MI: An Effective Diabetes Mellitus Classification Algorithm on Imbalanced Data With Missing Values.
Abstract
As a widely known chronic disease, diabetes mellitus is called a silent killer. It makes the body produce less insulin and causes increased blood sugar, which leads to many complications and affects the normal functioning of various organs, such as eyes, kidneys, and nerves. Although diabetes has attracted high attention in research, due to the existence of missing values and class imbalance in the data, the overall performance of diabetes classification using machine learning is relatively low. In this paper, we propose an effective Prediction algorithm for Diabetes Mellitus classification on Imbalanced data with Missing values (DMP_MI). First, the missing values are compensated by the Naive Bayes (NB) method for data normalization. Then, an adaptive synthetic sampling method (ADASYN) is adopted to reduce the influence of class imbalance on the prediction performance. Finally, a random forest (RF) classifier is used to generate predictions and evaluated using comprehensive set of evaluation indicators. Experiments performed on Pima Indians diabetes dataset from the University of California at Irvine, Irvine (UCI) Repository, have demonstrated the effectiveness and superiority of our proposed DMP_MI.
Year
DOI
Venue
2019
10.1109/ACCESS.2019.2929866
IEEE ACCESS
Keywords
DocType
Volume
Diabetes mellitus prediction,machine learning,adaptive synthetic sampling
Journal
7
ISSN
Citations 
PageRank 
2169-3536
0
0.34
References 
Authors
0
6
Name
Order
Citations
PageRank
Qian Wang164.86
Weijia Cao200.34
Jiawei Guo300.68
Jiadong Ren43912.15
Yongqiang Cheng513329.99
Darryl N. Davis65512.83