Blue Sky: Expert-in-the-Loop Representation Learning Framework for Audio Anti-Spoofing: Multimodal, Multilingual, Multi-speaker, Multi-attack (4M) Scenarios
dc.contributor.author | Khanjani, Zahra | |
dc.contributor.author | Janeja, Vandana | |
dc.contributor.author | Mallinson, Christine | |
dc.contributor.author | Purushotham, Sanjay | |
dc.date.accessioned | 2025-06-17T14:44:59Z | |
dc.date.available | 2025-06-17T14:44:59Z | |
dc.date.issued | 2025 | |
dc.description | 2025 SIAM International Conference on Data Mining (SDM25) May 1 - 3, Alexandria, Virginia | |
dc.description.abstract | Audio spoofing has surged with the rise of generative artificial intelligence, posing a serious threat to online communication. Recent studies have shown promising avenues in detecting spoofed audio specifically those that use human expert knowledge in representation learning, but more work is needed to evaluate performance across various realistic scenarios that tend to pose challenges in spoofed audio detection. In this paper, we introduce a comprehensive framework for expert-in-the-loop representation learning for audio anti-spoofing that is robust enough to address four specific challenging scenarios. Multimodal, Multilingual, Multi-speaker, and Multi-attack (4M). Preliminary results demonstrate the framework’s potential effectiveness in audio anti-spoofing. | |
dc.description.sponsorship | Authors would like to acknowledge NSF award 2346473 and 2210011 Authors would like to ac knowledge the contribution of Pragya Pandit for designing the infographic | |
dc.description.uri | https://epubs.siam.org/doi/10.1137/1.9781611978520.32 | |
dc.format.extent | 4 pages | |
dc.genre | conference papers and proceedings | |
dc.identifier.citation | Khanjani, Zahra, Vandana P. Janeja, Christine Mallinson, and Sanjay Purushotham. “Blue Sky: Expert-in-the-Loop Representation Learning Framework for Audio Anti-Spoofing: Multimodal, Multilingual, Multi-Speaker, Multi-Attack (4M) Scenarios.” In Proceedings of the 2025 SIAM International Conference on Data Mining (SDM), 327–30. Proceedings. Society for Industrial and Applied Mathematics, 2025. https://doi.org/10.1137/1.9781611978520.32. | |
dc.identifier.uri | https://doi.org/10.1137/1.9781611978520.32 | |
dc.identifier.uri | http://hdl.handle.net/11603/38817 | |
dc.language.iso | en_US | |
dc.publisher | SIAM International Conference on Data Mining | |
dc.relation.isAvailableAt | The University of Maryland, Baltimore County (UMBC) | |
dc.relation.ispartof | UMBC Center for Social Science Scholarship | |
dc.relation.ispartof | UMBC Information Systems Department | |
dc.relation.ispartof | UMBC Office for the Vice President of Research | |
dc.relation.ispartof | UMBC Language, Literacy, and Culture Department | |
dc.relation.ispartof | UMBC Faculty Collection | |
dc.rights | © 2025 Society for Industrial and Applied Mathematics | |
dc.subject | Multilingual | |
dc.subject | UMBC Cybersecurity Institute | |
dc.subject | generative arti-ficial intelligence | |
dc.subject | Multi-attack (4M )Scenarios | |
dc.subject | Audio spoofing | |
dc.subject | Multimodal | |
dc.subject | UMBC M | |
dc.subject | Multi-speaker | |
dc.title | Blue Sky: Expert-in-the-Loop Representation Learning Framework for Audio Anti-Spoofing: Multimodal, Multilingual, Multi-speaker, Multi-attack (4M) Scenarios | |
dc.type | Text | |
dcterms.creator | https://orcid.org/0000-0003-0130-6135 |
Files
Original bundle
1 - 1 of 1