Title
Strategies for handling missing clinical data for automated surgical site infection detection from the electronic health record.
Abstract
Display Omitted For each patients many data elements in the EHR are missing (e.g. test not necessary).We compared commonly used imputation methods for the problem of SSI detection.Imputation offered superior performance over complete-case analysis.Some very simple techniques offered excellent performance. Proper handling of missing data is important for many secondary uses of electronic health record (EHR) data. Data imputation methods can be used to handle missing data, but their use for analyzing EHR data is limited and specific efficacy for postoperative complication detection is unclear. Several data imputation methods were used to develop data models for automated detection of three types (i.e., superficial, deep, and organ space) of surgical site infection (SSI) and overall SSI using American College of Surgeons National Surgical Quality Improvement Project (NSQIP) Registry 30-day SSI occurrence data as a reference standard. Overall, models with missing data imputation almost always outperformed reference models without imputation that included only cases with complete data for detection of SSI overall achieving very good average area under the curve values. Missing data imputation appears to be an effective means for improving postoperative SSI detection using EHR clinical data.
Year
DOI
Venue
2017
10.1016/j.jbi.2017.03.009
Journal of Biomedical Informatics
Keywords
Field
DocType
Electronic health records,Missing data,Surgical site infections
Data mining,Data collection,Data modeling,Computer science,Medical record,Missing data,Imputation (statistics),Reference standards,Quality management,Missing data imputation
Journal
Volume
Issue
ISSN
68
C
1532-0464
Citations 
PageRank 
References 
3
0.43
5
Authors
6
Name
Order
Citations
PageRank
Zhen Hu1102.77
G B Melton226445.72
Elliot G. Arsoniadis382.89
Yan Wang42510.08
Mary R. Kwaan572.19
Gyorgy J. Simon66315.40