Title
Hiding information by cell suppression.
Abstract
Joining relational data can jeopardize patient confidentiality if disseminated data for research can be joined with publicly available data containing, for example, explicit identifiers. Ambiguity in data hinders the construction of primary keys that are of importance when joining data tables. We define two values to be indiscernible if they are the same or at least one of them is a special value suppressed. Two rows in a data table are indiscernible if their corresponding entries are indiscernible. We further define a table to be k-ambiguous if each row is indiscernible from at least k rows in the same table. We present two simple heuristics to make a table k-ambiguous by cell suppression, and compare them on example data.
Year
Venue
Field
2001
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION
Row,Data mining,Relational database,Identifier,Confidentiality,Information retrieval,Computer science,Patient confidentiality,Heuristics,Ambiguity,Table (information)
DocType
Issue
ISSN
Conference
SUPnan
1067-5027
Citations 
PageRank 
References 
3
0.73
0
Authors
3
Name
Order
Citations
PageRank
Staal Vinterbo136132.66
Lucila Ohno-Machado21426187.95
Stephan Dreiseitl333834.80