GenAIPABench: A Benchmark for Generative AI-based Privacy Assistants

Hamid, Aamir; Reddy Samidi, Hemanth; Finin, Tim; Pappachan, Primal; Yus, Roberto

GenAIPABench: A Benchmark for Generative AI-based Privacy Assistants

dc.contributor.author	Hamid, Aamir
dc.contributor.author	Reddy Samidi, Hemanth
dc.contributor.author	Finin, Tim
dc.contributor.author	Pappachan, Primal
dc.contributor.author	Yus, Roberto
dc.date.accessioned	2023-09-27T15:35:51Z
dc.date.available	2023-09-27T15:35:51Z
dc.date.issued	2024
dc.description	Proceedings on Privacy Enhancing Technologies 2024(3)
dc.description.abstract	Website privacy policies are often lengthy and intricate. Privacy assistants assist in simplifying policies and making them more accessible and user-friendly. The emergence of generative AI (genAI) offers new opportunities to build privacy assistants that can answer users’ questions about privacy policies. However, genAI’s reliability is a concern due to its potential for producing inaccurate information. This study introduces GenAIPABench, a benchmark for evaluating Generative AI-based Privacy Assistants (GenAIPAs). GenAIPABench includes: 1) A set of curated questions about privacy policies along with annotated answers for various organizations and regulations; 2) Metrics to assess the accuracy, relevance, and consistency of responses; and 3) A tool for generating prompts to introduce privacy policies and paraphrased variants of the curated questions. We evaluated 3 leading genAI systems—ChatGPT-4, Bard, and Bing AI—using GenAIPABench to gauge their effectiveness as GenAIPAs. Our results demonstrate significant promise in genAI capabilities in the privacy domain while also highlighting challenges in managing complex queries, ensuring consistency, and verifying source accuracy.	en
dc.description.uri	https://petsymposium.org/popets/2024/popets-2024-0081.php	en
dc.format.extent	17 pages	en
dc.genre	conference papers and proceedings
dc.genre	journal articles	en
dc.genre	presentations (communicative events)	en
dc.identifier	doi:10.13016/m29yec-3iy3
dc.identifier.citation	Hamid, Aamir, Hemanth Reddy Samidi, Primal Pappachan, Tim Finin, and Roberto Yus. “GenAIPABench: A Benchmark for Generative AI-Based Privacy Assistants.” Proceedings on Privacy Enhancing Technologies, 2024. https://petsymposium.org/popets/2024/popets-2024-0081.php.
dc.identifier.uri	https://doi.org/10.56553/popets-2024-0081
dc.identifier.uri	http://hdl.handle.net/11603/29899
dc.language.iso	en	en
dc.publisher	Proceedings on Privacy Enhancing Technologies
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartof	UMBC Faculty Collection
dc.relation.ispartof	UMBC Student Collection
dc.rights	Attribution 4.0 International (CC BY 4.0)	*
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	*
dc.subject	UMBC Ebiquity Research Group
dc.title	GenAIPABench: A Benchmark for Generative AI-based Privacy Assistants	en
dc.type	Text	en
dcterms.creator	https://orcid.org/0000-0002-6593-1792	en
dcterms.creator	https://orcid.org/0000-0002-9311-954X	en