PromptGuard at BLP-2025 Task 1: A Few-Shot Classification Framework Using Majority Voting and Keyword Similarity for Bengali Hate Speech Detection

Hossan, Rakib; Roy Dipta, Shubhashis

PromptGuard at BLP-2025 Task 1: A Few-Shot Classification Framework Using Majority Voting and Keyword Similarity for Bengali Hate Speech Detection

dc.contributor.author	Hossan, Rakib
dc.contributor.author	Roy Dipta, Shubhashis
dc.date.accessioned	2025-11-21T00:29:46Z
dc.date.issued	2025-10-10
dc.description.abstract	The BLP-2025 Task 1A requires Bengali hate speech classification into six categories. Traditional supervised approaches need extensive labeled datasets that are expensive for low-resource languages. We developed PromptGuard, a few-shot framework combining chi-square statistical analysis for keyword extraction with adaptive majority voting for decision-making. We explore statistical keyword selection versus random approaches and adaptive voting mechanisms that extend classification based on consensus quality. Chi-square keywords provide consistent improvements across categories, while adaptive voting benefits ambiguous cases requiring extended classification rounds. PromptGuard achieves a micro-F1 of 67.61, outperforming n-gram baselines (60.75) and random approaches (14.65). Ablation studies confirm chi-square-based keywords show the most consistent impact across all categories.
dc.description.uri	http://arxiv.org/abs/2510.09771
dc.format.extent	7 pages
dc.genre	journal articles
dc.genre	preprints
dc.identifier	doi:10.13016/m2cjyw-tmp1
dc.identifier.uri	https://doi.org/10.48550/arXiv.2510.09771
dc.identifier.uri	http://hdl.handle.net/11603/40793
dc.language.iso	en
dc.language.iso	bn
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Student Collection
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department
dc.rights	This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subject	UMBC Interactive Robotics and Language Lab
dc.subject	Computer Science - Computation and Language
dc.subject	Computer Science - Artificial Intelligence
dc.title	PromptGuard at BLP-2025 Task 1: A Few-Shot Classification Framework Using Majority Voting and Keyword Similarity for Bengali Hate Speech Detection
dc.type	Text
dcterms.creator	https://orcid.org/0000-0002-9176-1782

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 251009771v1.pdf
Size:: 306.32 KB
Format:: Adobe Portable Document Format

Download

Collections

UMBC Student Collection
UMBC Computer Science and Electrical Engineering Department