Title
An Extensible De-Identification Framework for Privacy Protection of Unstructured Health Information: Creating Sustainable Privacy Infrastructures.
Abstract
The volume of unstructured health records has increased exponentially across healthcare settings. Similarly, the number of healthcare providers that wish to exchange records has also increased and, as a result, de-identification and the preservation of privacy features have become increasingly important and necessary. Governance guidelines now require sensitive information to be masked or removed yet this remains a difficult and often ad-hoc task, particularly when dealing with unstructured text. Annotators are typically used to identify such sensitive information but they may only be effective in certain text fragments. There is at present no hybrid, sustainable framework that aggregates different annotators together. This paper proposes a novel framework that leverages a combination of state-of-the-art annotators in order to maximize the effectiveness of the de-identification of health information.
Year
DOI
Venue
2019
10.3233/SHTI190404
Studies in Health Technology and Informatics
Keywords
Field
DocType
Privacy,De-Identification,Natural Language Processing
Data science,De-identification,Knowledge management,Medicine,Health information
Conference
Volume
ISSN
Citations 
264
0926-9630
0
PageRank 
References 
Authors
0.34
0
4
Name
Order
Citations
PageRank
Stefano Braghin182.56
Joao H. Bettencourt-Silva231.74
Killian Levacher300.34
Spiros Antonatos432118.61