PromptGuard at BLP-2025 Task 1: A Few-Shot Classification Framework Using Majority Voting and Keyword Similarity for Bengali Hate Speech Detection

dc.contributor.authorHossan, Rakib
dc.contributor.authorRoy Dipta, Shubhashis
dc.date.accessioned2025-11-21T00:29:46Z
dc.date.issued2025-10-10
dc.description.abstractThe BLP-2025 Task 1A requires Bengali hate speech classification into six categories. Traditional supervised approaches need extensive labeled datasets that are expensive for low-resource languages. We developed PromptGuard, a few-shot framework combining chi-square statistical analysis for keyword extraction with adaptive majority voting for decision-making. We explore statistical keyword selection versus random approaches and adaptive voting mechanisms that extend classification based on consensus quality. Chi-square keywords provide consistent improvements across categories, while adaptive voting benefits ambiguous cases requiring extended classification rounds. PromptGuard achieves a micro-F1 of 67.61, outperforming n-gram baselines (60.75) and random approaches (14.65). Ablation studies confirm chi-square-based keywords show the most consistent impact across all categories.
dc.description.urihttp://arxiv.org/abs/2510.09771
dc.format.extent7 pages
dc.genrejournal articles
dc.genrepreprints
dc.identifierdoi:10.13016/m2cjyw-tmp1
dc.identifier.urihttps://doi.org/10.48550/arXiv.2510.09771
dc.identifier.urihttp://hdl.handle.net/11603/40793
dc.language.isoen
dc.language.isobn
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subjectUMBC Interactive Robotics and Language Lab
dc.subjectComputer Science - Computation and Language
dc.subjectComputer Science - Artificial Intelligence
dc.titlePromptGuard at BLP-2025 Task 1: A Few-Shot Classification Framework Using Majority Voting and Keyword Similarity for Bengali Hate Speech Detection
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0002-9176-1782

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
251009771v1.pdf
Size:
306.32 KB
Format:
Adobe Portable Document Format