Investigating Symbolic Triggers of Hallucination in Gemma Models Across HaluEval and TruthfulQA

dc.contributor.authorLamba, Naveen
dc.contributor.authorTiwari, Sanju
dc.contributor.authorGaur, Manas
dc.date.accessioned2025-09-18T14:22:25Z
dc.date.issued2025-09
dc.descriptionSymGenAI4Sci 2025: First International Workshop on Symbolic and Generative AI for Science, September 3–5, 2025, Vienna, Austria
dc.description.abstractHallucination in Large Language Models(LLMs) is a well studied problem. However, the properties that make LLM intrinsically vulnerable to hallucinations have not been identified and studied. This research identifies and characterizes the key properties, allowing us to pinpoint vulnerabilities within the model’s internal mechanisms. To solidify on these properties, we utilized two established datasets, HaluEval and TruthfulQA and convert their existing format of question answering into various other formats to narrow down these properties as the reason for the hallucinations. Our findings reveal that hallucination percentages across symbolic properties are notably high for Gemma-2-2B, averaging 79.0% across tasks and datasets. With increased model scale, hallucination drops to 73.6% for Gemma-2-9B and 63.9% for Gemma-2-27B, reflecting a 15 percentage point reduction overall. Although the hallucination rate decreases as the model size increases, a substantial amount of hallucination caused by symbolic properties still persists. This is especially evident for modifiers (ranging from 84.76% to 94.98%) and named entities (ranging from 83.87% to 93.96%) across all Gemma models and both datasets. These findings indicate that symbolic elements continue to confuse the models, pointing to a fundamental weakness in how these LLMs process such inputs—regardless of their scale.
dc.description.urihttps://2025-eu.semantics.cc/
dc.format.extent13 pages
dc.genreconference papers and proceedings
dc.genrepostprints
dc.identifierdoi:10.13016/m2zrwq-xjv5
dc.identifier.urihttp://hdl.handle.net/11603/40241
dc.language.isoen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.relation.ispartofUMBC Faculty Collection
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectUMBC Ebiquity Research Group
dc.titleInvestigating Symbolic Triggers of Hallucination in Gemma Models Across HaluEval and TruthfulQA
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0002-5411-2230

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Investigating_Symbolic_Triggers_of_Hallucination_in_Gemma_Models.pdf
Size:
1.07 MB
Format:
Adobe Portable Document Format