Blue Sky: Expert-in-the-Loop Representation Learning Framework for Audio Anti-Spoofing: Multimodal, Multilingual, Multi-speaker, Multi-attack (4M) Scenarios

Khanjani, Zahra; Janeja, Vandana; Mallinson, Christine; Purushotham, Sanjay

Blue Sky: Expert-in-the-Loop Representation Learning Framework for Audio Anti-Spoofing: Multimodal, Multilingual, Multi-speaker, Multi-attack (4M) Scenarios

dc.contributor.author	Khanjani, Zahra
dc.contributor.author	Janeja, Vandana
dc.contributor.author	Mallinson, Christine
dc.contributor.author	Purushotham, Sanjay
dc.date.accessioned	2025-06-17T14:44:59Z
dc.date.available	2025-06-17T14:44:59Z
dc.date.issued	2025
dc.description	2025 SIAM International Conference on Data Mining (SDM25) May 1 - 3, Alexandria, Virginia
dc.description.abstract	Audio spoofing has surged with the rise of generative artificial intelligence, posing a serious threat to online communication. Recent studies have shown promising avenues in detecting spoofed audio specifically those that use human expert knowledge in representation learning, but more work is needed to evaluate performance across various realistic scenarios that tend to pose challenges in spoofed audio detection. In this paper, we introduce a comprehensive framework for expert-in-the-loop representation learning for audio anti-spoofing that is robust enough to address four specific challenging scenarios. Multimodal, Multilingual, Multi-speaker, and Multi-attack (4M). Preliminary results demonstrate the framework’s potential effectiveness in audio anti-spoofing.
dc.description.sponsorship	Authors would like to acknowledge NSF award 2346473 and 2210011 Authors would like to ac knowledge the contribution of Pragya Pandit for designing the infographic
dc.description.uri	https://epubs.siam.org/doi/10.1137/1.9781611978520.32
dc.format.extent	4 pages
dc.genre	conference papers and proceedings
dc.identifier	doi:10.13016/m2fe5s-foeu
dc.identifier.citation	Khanjani, Zahra, Vandana P. Janeja, Christine Mallinson, and Sanjay Purushotham. “Blue Sky: Expert-in-the-Loop Representation Learning Framework for Audio Anti-Spoofing: Multimodal, Multilingual, Multi-Speaker, Multi-Attack (4M) Scenarios.” In Proceedings of the 2025 SIAM International Conference on Data Mining (SDM), 327–30. Proceedings. Society for Industrial and Applied Mathematics, 2025. https://doi.org/10.1137/1.9781611978520.32.
dc.identifier.uri	https://doi.org/10.1137/1.9781611978520.32
dc.identifier.uri	http://hdl.handle.net/11603/38817
dc.language.iso	en_US
dc.publisher	SIAM International Conference on Data Mining
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Center for Social Science Scholarship
dc.relation.ispartof	UMBC Information Systems Department
dc.relation.ispartof	UMBC Office for the Vice President of Research
dc.relation.ispartof	UMBC Language, Literacy, and Culture Department
dc.relation.ispartof	UMBC Faculty Collection
dc.rights	© 2025 Society for Industrial and Applied Mathematics
dc.subject	Multilingual
dc.subject	UMBC Cybersecurity Institute
dc.subject	generative arti-ficial intelligence
dc.subject	Multi-attack (4M )Scenarios
dc.subject	Audio spoofing
dc.subject	Multimodal
dc.subject	UMBC M
dc.subject	Multi-speaker
dc.title	Blue Sky: Expert-in-the-Loop Representation Learning Framework for Audio Anti-Spoofing: Multimodal, Multilingual, Multi-speaker, Multi-attack (4M) Scenarios
dc.type	Text
dcterms.creator	https://orcid.org/0000-0003-0130-6135

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 1.9781611978520.32.pdf
Size:: 759.17 KB
Format:: Adobe Portable Document Format

Download

Collections

UMBC Center for Social Science Scholarship
UMBC Faculty Collection
UMBC Information Systems Department
UMBC Language, Literacy, and Culture Department
UMBC Office for the Vice President of Research & Creative Achievement (ORCA)