Measuring Semantic Similarity across EU GDPR Regulation and Cloud Privacy Policies

dc.contributor.authorElluri, Lavanya
dc.contributor.authorJoshi, Karuna Pande
dc.contributor.authorKotal, Anantaa
dc.date.accessioned2020-12-14T17:08:21Z
dc.date.available2020-12-14T17:08:21Z
dc.date.issued2020-12-13
dc.description7th International Workshop on Privacy and Security of Big Data (PSBD 2020), in conjunction with 2020 IEEE International Conference on Big Data (IEEE BigData 2020)en_US
dc.description2020 IEEE International Conference on Big Data (Big Data), 10-13 December 2020, Atlanta, GA, USA
dc.description.abstractData protection authorities formulate policies and rules which the service providers have to comply with to ensure security and privacy when they perform Big Data analytics using users Personally Identifiable Information (PII). The knowledge contained in the data regulations and organizational privacy policies are typically maintained as short unstructured text in HTML or PDF formats. Hence it is an open challenge to determine the specific regulation rules that are being addressed by a provider’s privacy policies. We have developed a semantically rich framework, using techniques from Semantic Web and Natural Language Processing, to extract and compare the context of a short text in real-time. This framework allows automated incremental text comparison and identifying context from short text policy documents by determining the semantic similarity score and extracting semantically similar key terms. Additionally, we also created a knowledge graph to store the semantically similar comparison results while evaluating our framework across EU GDPR and privacy policies of 20 organizations complying with this regulation associated with various categories apply to Big Data stored in the cloud. Our approach can be utilized by Big Data practitioners to update their referential documents regularly based on the authority documents.en_US
dc.description.urihttps://ieeexplore.ieee.org/document/9377864en_US
dc.format.extent10 pagesen_US
dc.genreconference papers and proceedings preprintsen_US
dc.identifierdoi:10.13016/m2mbe2-jyii
dc.identifier.citationL. Elluri, K. Pande Joshi and A. Kotal, "Measuring Semantic Similarity across EU GDPR Regulation and Cloud Privacy Policies," 2020 IEEE International Conference on Big Data (Big Data), 2020, pp. 3963-3978, doi: 10.1109/BigData50022.2020.9377864.en_US
dc.identifier.urihttp://hdl.handle.net/11603/20257
dc.identifier.urihttps://doi.org/10.1109/BigData50022.2020.9377864
dc.language.isoen_USen_US
dc.publisherIEEEen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Information Systems Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.rights© 2020 IEEE.  Personal use of this material is permitted.  Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
dc.subjectUMBC Ebiquity Research Group
dc.titleMeasuring Semantic Similarity across EU GDPR Regulation and Cloud Privacy Policiesen_US
dc.typeTexten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1054.pdf
Size:
1.42 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: