Title
CRDT: Correlation Ratio Based Decision Tree Model for Healthcare Data Mining
Abstract
The phenomenal growth in the healthcare data has inspired us in investigating robust and scalable models for data mining. For classification problems Information Gain(IG) based Decision Tree is one of the popular choices. However, depending upon the nature of the dataset, IG based Decision Tree may not always perform well as it prefers the attribute with more number of distinct values as the splitting attribute. Healthcare datasets generally have many attributes and each attribute generally has many distinct values. In this paper, we have tried to focus on this characteristics of the datasets while analysing the performance of our proposed approach which is a variant of Decision Tree model and uses the concept of Correlation Ratio(CR). Unlike IG based approach, this CR based approach has no biasness towards the attribute with more number of distinct values. We have applied our model on some benchmark healthcare datasets to show the effectiveness of the proposed technique.
Year
DOI
Venue
2015
10.1109/BIBE.2016.21
2016 IEEE 16th International Conference on Bioinformatics and Bioengineering (BIBE)
Keywords
Field
DocType
Data Mining,Healthcare,Decision Tree,Information Gain,Correlation Ratio
Health care,Data mining,Decision tree,Computer science,Decision tree model,Correlation ratio,Artificial intelligence,Information gain ratio,Machine learning,Decision tree learning,Scalability,Incremental decision tree
Journal
Volume
ISSN
ISBN
abs/1509.07266
2471-7819
978-1-5090-3835-0
Citations 
PageRank 
References 
2
0.39
7
Authors
3
Name
Order
Citations
PageRank
smita roy120.39
Samrat Mondal210018.02
Asif Ekbal3737119.31