Clustering Approaches for Anonymizing High-Dimensional Sequential Activity Data

dc.contributor.advisorChen, Zhiyuan
dc.contributor.advisorKoru, Gunes
dc.contributor.authorParameshwarappa, Pooja
dc.contributor.departmentInformation Systems
dc.contributor.programInformation Systems
dc.date.accessioned2021-09-01T13:55:50Z
dc.date.available2021-09-01T13:55:50Z
dc.date.issued2020-01-01
dc.description.abstractIn the current IoT era, collection of activity data such as physical and daily activity data has become ubiquitous. Publishing activity data can facilitate personal and population health management and promote reproducible health care research. However, publishing such data can also bring high privacy risks including re-identification of individuals in the data set. Therefore, there is a growing need for anonymizing the data before publishing. One of the challenges in anonymizing sequential data such as activity data is its high-dimensional nature. Although existing techniques work sufficiently for cross-sectional data, they result in low run-time performance when applied directly to sequential data. In this research, we propose Multi-level Clustering (MC) based anonymization approaches that apply k-anonymity, differential privacy, and l-diversity privacy models. The proposed MC step improves the performance of the anonymization approaches by reducing the clustering time drastically. Results show that the proposed approaches in addition to being more efficient than the existing approaches, also preserve the utility of the data as much as the existing approaches.
dc.formatapplication:pdf
dc.genredissertations
dc.identifierdoi:10.13016/m2bjy8-5juc
dc.identifier.other12170
dc.identifier.urihttp://hdl.handle.net/11603/22905
dc.languageen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Information Systems Department Collection
dc.relation.ispartofUMBC Theses and Dissertations Collection
dc.relation.ispartofUMBC Graduate School Collection
dc.relation.ispartofUMBC Student Collection
dc.sourceOriginal File Name: Parameshwarappa_umbc_0434D_12170.pdf
dc.subjectActivity data
dc.subjectAnonymization
dc.subjectClustering
dc.subjectHigh-dimensional data
dc.subjectLongitudinal data
dc.subjectPrivacy
dc.titleClustering Approaches for Anonymizing High-Dimensional Sequential Activity Data
dc.typeText
dcterms.accessRightsAccess limited to the UMBC community. Item may possibly be obtained via Interlibrary Loan thorugh a local library, pending author/copyright holder's permission.
dcterms.accessRightsThis item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please see http://aok.lib.umbc.edu/specoll/repro.php or contact Special Collections at speccoll(at)umbc.edu

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Parameshwarappa_umbc_0434D_12170.pdf
Size:
2.12 MB
Format:
Adobe Portable Document Format