Title
Realistic Synthetic Data Generation - The ATEN Framework.
Abstract
Getting access to real medical data for research is notoriously difficult. Even when data exist they are usually incomplete and subject to restrictions due to confidentiality and privacy. Synthetic data (SD) are best replacements for real data but must be verifiably realistic. There is little or no investigation into systematically achieving realism in SD. This work investigates this problem, and contributes the ATEN framework, which incorporates three component approaches: (1) THOTH for synthetic data generation (SDG): (2) RA for characterising realism is SD, and (3) HORUS for validating realism in SD. The framework is found promising after its use in generating the realistic synthetic EHR (RS-EHR) for labour and birth. This framework is significant in guaranteeing realism in SDG projects. Future efforts focus on further validation of ATEN in a controlled multi-stream SDG process.
Year
DOI
Venue
2018
10.1007/978-3-030-29196-9_25
BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, BIOSTEC 2018
Keywords
DocType
Volume
Synthetic data generation,Knowledge discovery
Conference
1024
ISSN
Citations 
PageRank 
1865-0929
0
0.34
References 
Authors
0
5
Name
Order
Citations
PageRank
Scott McLachlan101.35
Kudakwashe Dube25011.09
Thomas Gallagher300.34
Jennifer A. Simmonds400.34
Norman E. Fenton52331235.25