Title | ||
---|---|---|
A preliminary study on automatic identification of patient smoking status in unstructured electronic health records |
Abstract | ||
---|---|---|
Identifying smoking status of patients is vital for assessing their risk for a disease. With the rapid adoption of electronic health records (EHRs), patient information is scattered across various systems in the form of structured and unstructured data. In this study, we aimed to develop a hybrid system using rule-based, unsupervised and supervised machine learning techniques to automatically identify the smoking status of patients in unstructured EHRs. In addition to traditional features, we used per-document topic model distribution weights as features in our system. We also discuss the performance of our hybrid system using different feature sets. Our preliminary results demonstrated that combining per-document topic model distribution weights with traditional features improve the overall performance of the system. |
Year | DOI | Venue |
---|---|---|
2015 | 10.18653/v1/W15-3818 | BioNLP@IJCNLP |
Field | DocType | Citations |
Data mining,Unstructured data,Artificial intelligence,Topic model,Hybrid system,Medicine,Machine learning | Conference | 2 |
PageRank | References | Authors |
0.41 | 5 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Jitendra Jonnagaddala | 1 | 46 | 10.28 |
Hong-Jie Dai | 2 | 288 | 21.58 |
pradeep ray | 3 | 3 | 0.80 |
Siaw-Teng Liaw | 4 | 57 | 13.79 |