Title
A preliminary study on automatic identification of patient smoking status in unstructured electronic health records
Abstract
Identifying smoking status of patients is vital for assessing their risk for a disease. With the rapid adoption of electronic health records (EHRs), patient information is scattered across various systems in the form of structured and unstructured data. In this study, we aimed to develop a hybrid system using rule-based, unsupervised and supervised machine learning techniques to automatically identify the smoking status of patients in unstructured EHRs. In addition to traditional features, we used per-document topic model distribution weights as features in our system. We also discuss the performance of our hybrid system using different feature sets. Our preliminary results demonstrated that combining per-document topic model distribution weights with traditional features improve the overall performance of the system.
Year
DOI
Venue
2015
10.18653/v1/W15-3818
BioNLP@IJCNLP
Field
DocType
Citations 
Data mining,Unstructured data,Artificial intelligence,Topic model,Hybrid system,Medicine,Machine learning
Conference
2
PageRank 
References 
Authors
0.41
5
4
Name
Order
Citations
PageRank
Jitendra Jonnagaddala14610.28
Hong-Jie Dai228821.58
pradeep ray330.80
Siaw-Teng Liaw45713.79