Chen, ZhiyuanKoru, GunesParameshwarappa, Pooja2021-09-012021-09-012020-01-0112170http://hdl.handle.net/11603/22905In the current IoT era, collection of activity data such as physical and daily activity data has become ubiquitous. Publishing activity data can facilitate personal and population health management and promote reproducible health care research. However, publishing such data can also bring high privacy risks including re-identification of individuals in the data set. Therefore, there is a growing need for anonymizing the data before publishing. One of the challenges in anonymizing sequential data such as activity data is its high-dimensional nature. Although existing techniques work sufficiently for cross-sectional data, they result in low run-time performance when applied directly to sequential data. In this research, we propose Multi-level Clustering (MC) based anonymization approaches that apply k-anonymity, differential privacy, and l-diversity privacy models. The proposed MC step improves the performance of the anonymization approaches by reducing the clustering time drastically. Results show that the proposed approaches in addition to being more efficient than the existing approaches, also preserve the utility of the data as much as the existing approaches.application:pdfActivity dataAnonymizationClusteringHigh-dimensional dataLongitudinal dataPrivacyClustering Approaches for Anonymizing High-Dimensional Sequential Activity DataText