PromptGuard at BLP-2025 Task 1: A Few-Shot Classification Framework Using Majority Voting and Keyword Similarity for Bengali Hate Speech Detection
| dc.contributor.author | Hossan, Rakib | |
| dc.contributor.author | Roy Dipta, Shubhashis | |
| dc.date.accessioned | 2025-11-21T00:29:46Z | |
| dc.date.issued | 2025-10-10 | |
| dc.description.abstract | The BLP-2025 Task 1A requires Bengali hate speech classification into six categories. Traditional supervised approaches need extensive labeled datasets that are expensive for low-resource languages. We developed PromptGuard, a few-shot framework combining chi-square statistical analysis for keyword extraction with adaptive majority voting for decision-making. We explore statistical keyword selection versus random approaches and adaptive voting mechanisms that extend classification based on consensus quality. Chi-square keywords provide consistent improvements across categories, while adaptive voting benefits ambiguous cases requiring extended classification rounds. PromptGuard achieves a micro-F1 of 67.61, outperforming n-gram baselines (60.75) and random approaches (14.65). Ablation studies confirm chi-square-based keywords show the most consistent impact across all categories. | |
| dc.description.uri | http://arxiv.org/abs/2510.09771 | |
| dc.format.extent | 7 pages | |
| dc.genre | journal articles | |
| dc.genre | preprints | |
| dc.identifier | doi:10.13016/m2cjyw-tmp1 | |
| dc.identifier.uri | https://doi.org/10.48550/arXiv.2510.09771 | |
| dc.identifier.uri | http://hdl.handle.net/11603/40793 | |
| dc.language.iso | en | |
| dc.language.iso | bn | |
| dc.relation.isAvailableAt | The University of Maryland, Baltimore County (UMBC) | |
| dc.relation.ispartof | UMBC Student Collection | |
| dc.relation.ispartof | UMBC Computer Science and Electrical Engineering Department | |
| dc.rights | This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author. | |
| dc.subject | UMBC Interactive Robotics and Language Lab | |
| dc.subject | Computer Science - Computation and Language | |
| dc.subject | Computer Science - Artificial Intelligence | |
| dc.title | PromptGuard at BLP-2025 Task 1: A Few-Shot Classification Framework Using Majority Voting and Keyword Similarity for Bengali Hate Speech Detection | |
| dc.type | Text | |
| dcterms.creator | https://orcid.org/0000-0002-9176-1782 |
Files
Original bundle
1 - 1 of 1
