Do LLMs Adhere to Label Definitions? Examining Their Receptivity to External Label Definitions
| dc.contributor.author | Mohammadi, Ali | |
| dc.contributor.author | Vedula, Bhaskara Hanuma | |
| dc.contributor.author | Lamba, Hemank | |
| dc.contributor.author | Raff, Edward | |
| dc.contributor.author | Kumaraguru, Ponnurangam | |
| dc.contributor.author | Ferraro, Francis | |
| dc.contributor.author | Gaur, Manas | |
| dc.date.accessioned | 2025-10-22T19:58:09Z | |
| dc.date.issued | 2025-09-02 | |
| dc.description | The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025),November 4th - 9th, 2025,Suzhou, China | |
| dc.description.abstract | Do LLMs genuinely incorporate external definitions, or do they primarily rely on their parametric knowledge? To address these questions, we conduct controlled experiments across multiple explanation benchmark datasets (general and domain-specific) and label definition conditions, including expert-curated, LLM-generated, perturbed, and swapped definitions. Our results reveal that while explicit label definitions can enhance accuracy and explainability, their integration into an LLM's task-solving processes is neither guaranteed nor consistent, suggesting reliance on internalized representations in many cases. Models often default to their internal representations, particularly in general tasks, whereas domain-specific tasks benefit more from explicit definitions. These findings underscore the need for a deeper understanding of how LLMs process external knowledge alongside their pre-existing capabilities. | |
| dc.description.uri | http://arxiv.org/abs/2509.02452 | |
| dc.format.extent | 13 pages | |
| dc.genre | conference papers and proceedings | |
| dc.genre | preprints | |
| dc.identifier | doi:10.13016/m2wgue-hh3r | |
| dc.identifier.uri | https://doi.org/10.48550/arXiv.2509.02452 | |
| dc.identifier.uri | http://hdl.handle.net/11603/40548 | |
| dc.language.iso | en | |
| dc.relation.isAvailableAt | The University of Maryland, Baltimore County (UMBC) | |
| dc.relation.ispartof | UMBC Data Science | |
| dc.relation.ispartof | UMBC Student Collection | |
| dc.relation.ispartof | UMBC Computer Science and Electrical Engineering Department | |
| dc.relation.ispartof | UMBC Faculty Collection | |
| dc.rights | Attribution 4.0 International | |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | UMBC Interactive Robotics and Language Lab | |
| dc.subject | UMBC Discovery, Research, and Experimental Analysis of Malware Lab (DREAM Lab) | |
| dc.subject | UMBC Interactive Robotics and Language Lab (IRAL Lab) | |
| dc.subject | Computer Science - Machine Learning | |
| dc.subject | Computer Science - Computation and Language | |
| dc.subject | UMBC Ebiquity Research Group | |
| dc.subject | Computer Science - Artificial Intelligence | |
| dc.title | Do LLMs Adhere to Label Definitions? Examining Their Receptivity to External Label Definitions | |
| dc.type | Text | |
| dcterms.creator | https://orcid.org/0000-0002-9900-1972 | |
| dcterms.creator | https://orcid.org/0000-0003-2413-9368 | |
| dcterms.creator | https://orcid.org/0000-0002-5411-2230 |
Files
Original bundle
1 - 1 of 1
