Title
Timeline and episode-structured clinical data: Pre-processing for Data Mining and analytics
Abstract
Data Mining has been used in the healthcare domain for diagnosis and treatment analysis, resource management and fraud detection. It brings a set of tools and techniques that can be applied to large-scale patient data to discover underlying patterns and provide healthcare professionals an additional source of knowledge for making decisions. The Southampton Breast Cancer Data System (SBCDS) containing some 16,000 timeline-structured records is a visually rich and highly intuitive system for the manual and automated transfer of demographic, pathology and treatment data into an episode-based structure. While expansion of the data mining capability in SBCDS is one of the objectives of our research, real-world patient data is generally incomplete, inconsistent and containing errors. This case study will focus on the data pre-processing stage in order to clean the raw data and prepare the final dataset for use in data mining and analytics. Some initial results are given for sequential patterns mining and classification which highlight the advantages of the approach.
Year
DOI
Venue
2016
10.1109/ICDEW.2016.7495618
2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)
Keywords
Field
DocType
Health informatics,electronic patient records,breast cancer data,pre-processing,data mining
Data science,Health care,Resource management,Data mining,Computer science,Raw data,Data pre-processing,Timeline,Analytics,Health informatics
Conference
Citations 
PageRank 
References 
2
0.38
5
Authors
4
Name
Order
Citations
PageRank
Jing Lu1134.01
Alan Hales220.38
David Rew330.79
Malcolm Keech4385.31