Abstract | ||
---|---|---|
Sanitization of a document involves removing sensitive information from the document, so that it may be distributed to a broader au- dience. Such sanitization is needed while declassifying documents involving sensitive or confidential information such as cor porate emails, intelligence reports, medical records, etc. In thi s paper, we present the ERASE framework for performing document sani- tization in an automated manner. ERASE can be used to sanitize a document dynamically, so that different users get different views of the same document based on what they are authorized to know. We formalize the problem and present algorithms used in ERASE for finding the appropriate terms to remove from the document. Ou r preliminary experimental study demonstrates the efficienc y and ef- ficacy of the proposed algorithms. |
Year | DOI | Venue |
---|---|---|
2008 | 10.1145/1458082.1458194 | International Conference on Information and Knowledge Management |
Keywords | Field | DocType |
different view,erase framework,automated manner,efficient technique,appropriate term,sensitive information,different user,present algorithm,confidential information,document sanitization,document dynamically,anonymization,redaction,sanitization,medical records | Data mining,World Wide Web,Confidentiality,Information retrieval,Redaction,Computer science,Information sensitivity | Conference |
Citations | PageRank | References |
20 | 0.89 | 9 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Venkatesan T. Chakaravarthy | 1 | 421 | 34.76 |
Himanshu Gupta | 2 | 2653 | 277.86 |
Prasan Roy | 3 | 1027 | 78.86 |
Mukesh K. Mohania | 4 | 593 | 169.17 |