Blue Sky: Expert-in-the-Loop Representation Learning Framework for Audio Anti-Spoofing: Multimodal, Multilingual, Multi-speaker, Multi-attack (4M) Scenarios

dc.contributor.authorKhanjani, Zahra
dc.contributor.authorJaneja, Vandana
dc.contributor.authorMallinson, Christine
dc.contributor.authorPurushotham, Sanjay
dc.date.accessioned2025-06-17T14:44:59Z
dc.date.available2025-06-17T14:44:59Z
dc.date.issued2025
dc.description2025 SIAM International Conference on Data Mining (SDM25) May 1 - 3, Alexandria, Virginia
dc.description.abstractAudio spoofing has surged with the rise of generative artificial intelligence, posing a serious threat to online communication. Recent studies have shown promising avenues in detecting spoofed audio specifically those that use human expert knowledge in representation learning, but more work is needed to evaluate performance across various realistic scenarios that tend to pose challenges in spoofed audio detection. In this paper, we introduce a comprehensive framework for expert-in-the-loop representation learning for audio anti-spoofing that is robust enough to address four specific challenging scenarios. Multimodal, Multilingual, Multi-speaker, and Multi-attack (4M). Preliminary results demonstrate the framework’s potential effectiveness in audio anti-spoofing.
dc.description.sponsorshipAuthors would like to acknowledge NSF award 2346473 and 2210011 Authors would like to ac knowledge the contribution of Pragya Pandit for designing the infographic
dc.description.urihttps://epubs.siam.org/doi/10.1137/1.9781611978520.32
dc.format.extent4 pages
dc.genreconference papers and proceedings
dc.identifier.citationKhanjani, Zahra, Vandana P. Janeja, Christine Mallinson, and Sanjay Purushotham. “Blue Sky: Expert-in-the-Loop Representation Learning Framework for Audio Anti-Spoofing: Multimodal, Multilingual, Multi-Speaker, Multi-Attack (4M) Scenarios.” In Proceedings of the 2025 SIAM International Conference on Data Mining (SDM), 327–30. Proceedings. Society for Industrial and Applied Mathematics, 2025. https://doi.org/10.1137/1.9781611978520.32.
dc.identifier.urihttps://doi.org/10.1137/1.9781611978520.32
dc.identifier.urihttp://hdl.handle.net/11603/38817
dc.language.isoen_US
dc.publisherSIAM International Conference on Data Mining
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Center for Social Science Scholarship
dc.relation.ispartofUMBC Information Systems Department
dc.relation.ispartofUMBC Office for the Vice President of Research
dc.relation.ispartofUMBC Language, Literacy, and Culture Department
dc.relation.ispartofUMBC Faculty Collection
dc.rights© 2025 Society for Industrial and Applied Mathematics
dc.subjectMultilingual
dc.subjectUMBC Cybersecurity Institute
dc.subjectgenerative arti-ficial intelligence
dc.subjectMulti-attack (4M )Scenarios
dc.subjectAudio spoofing
dc.subjectMultimodal
dc.subjectUMBC M
dc.subjectMulti-speaker
dc.titleBlue Sky: Expert-in-the-Loop Representation Learning Framework for Audio Anti-Spoofing: Multimodal, Multilingual, Multi-speaker, Multi-attack (4M) Scenarios
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0003-0130-6135

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1.9781611978520.32.pdf
Size:
759.17 KB
Format:
Adobe Portable Document Format