A data recipient centered de-identification method to retain statistical attributes

dc.contributor.authorGal, Tamas S.
dc.contributor.authorTucker, Thomas C.
dc.contributor.authorGangopadhyay, Aryya
dc.contributor.authorChen, Zhiyuan
dc.date.accessioned2021-08-16T17:43:30Z
dc.date.available2021-08-16T17:43:30Z
dc.date.issued2014-01-10
dc.description.abstractPrivacy has always been a great concern of patients and medical service providers. As a result of the recent advances in information technology and the government’s push for the use of Electronic Health Record (EHR) systems, a large amount of medical data is collected and stored electronically. This data needs to be made available for analysis but at the same time patient privacy has to be protected through de-identification. Although biomedical researchers often describe their research plans when they request anonymized data, most existing anonymization methods do not use this information when de-identifying the data. As a result, the anonymized data may not be useful for the planned research project. This paper proposes a data recipient centered approach to tailor the de-identification method based on input from the recipient of the data. We demonstrate our approach through an anonymization project for biomedical researchers with specific goals to improve the utility of the anonymized data for statistical models used for their research project. The selected algorithm improves a privacy protection method called Condensation by Aggarwal et al. Our methods were tested and validated on real cancer surveillance data provided by the Kentucky Cancer Registry.en_US
dc.description.urihttps://www.sciencedirect.com/science/article/pii/S1532046414000021?via%3Dihub#!en_US
dc.format.extent14 pagesen_US
dc.genrejournal articlesen_US
dc.identifierdoi:10.13016/m26v2u-oev7
dc.identifier.citationGal, Tamas S. et al.; A data recipient centered de-identification method to retain statistical attributes; Journal of Biomedical Informatics, Volume 50, Pages 32-45, 10 January 2014; https://doi.org/10.1016/j.jbi.2014.01.001en_US
dc.identifier.urihttps://doi.org/10.1016/j.jbi.2014.01.001
dc.identifier.urihttp://hdl.handle.net/11603/22455
dc.language.isoen_USen_US
dc.publisherElsevieren_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Information Systems Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.en_US
dc.subjectanonymizing medical recordsen_US
dc.subjectimproving the utility of the anonymized dataen_US
dc.titleA data recipient centered de-identification method to retain statistical attributesen_US
dc.typeTexten_US
dcterms.creatorhttps://orcid.org/0000-0002-6984-7248

Files

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: